Skip to main content

Documentation Index

Fetch the complete documentation index at: https://runcrate.ai/docs/llms.txt

Use this file to discover all available pages before exploring further.

For Agents

Fetch the complete documentation index at: https://runcrate.ai/docs/llms.txt
Runcrate is the complete platform for AI teams to access open-source models and GPU compute. One account gives you production inference for 140+ models, on-demand GPU instances, dedicated clusters, and the SDKs to build with all of it.

Quickstart

Make your first API call in under 60 seconds.

Model Catalog

Browse 140+ open-source models across text, image, video, and audio.

SDKs

Python and TypeScript clients. Drop-in OpenAI SDK replacements.

API Reference

Full REST API documentation for inference and infrastructure.

The Runcrate Platform

Everything your AI team needs: production inference, GPU compute, and dedicated clusters — all under one account and one bill.

Models API

Chat, image, video, TTS, and transcription via OpenAI-compatible endpoints.

GPU Instances

Deploy containers with dedicated NVIDIA GPUs in 60 seconds.

Storage

Persistent volumes with built-in file explorer.

Dedicated Clusters

Reserved bare-metal from 16 to 128+ nodes.

MCP Server

Control Runcrate from Claude, Cursor, or any AI assistant.

Explore use cases

See how teams use Runcrate to build AI products, run inference at scale, train models, and deploy custom servers.

AI SaaS Backend

Production AI backend with chat, image, and RAG.

RAG Pipeline

Retrieval-augmented generation with embeddings and vector search.

Fine-tune LLMs

Fine-tune Llama, Mistral, or Qwen on your own data.

Video Generation

Generate videos with Kling, Veo, Sora, and Seedance.

AI Chatbot (Next.js)

Streaming chatbot with the Vercel AI SDK and Runcrate.

ComfyUI in the Cloud

Run ComfyUI on a GPU instance with persistent model storage.

Deploy Llama

Self-host Llama 3 with vLLM for custom inference.

Voice Cloning

Clone any voice and generate speech with TTS models.

Start building

Python

Official Python client. Drop-in replacement for the OpenAI SDK.

TypeScript

Official TypeScript client for Node.js and edge runtimes.

Vercel AI SDK

First-class Runcrate provider for the Vercel AI SDK.

CLI

Full terminal control. Deploy instances, SSH in, transfer files, manage volumes — all from your terminal.