Documentation Index
Fetch the complete documentation index at: https://runcrate.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
Runcrate is the complete platform for AI teams to access open-source models and GPU compute. One account gives you production inference for 140+ models, on-demand GPU instances, dedicated clusters, and the SDKs to build with all of it.For Agents
Fetch the complete documentation index at: https://runcrate.ai/docs/llms.txt
Quickstart
Make your first API call in under 60 seconds.
Model Catalog
Browse 140+ open-source models across text, image, video, and audio.
SDKs
Python and TypeScript clients. Drop-in OpenAI SDK replacements.
API Reference
Full REST API documentation for inference and infrastructure.
The Runcrate Platform
Everything your AI team needs: production inference, GPU compute, and dedicated clusters — all under one account and one bill.Models API
Chat, image, video, TTS, and transcription via OpenAI-compatible endpoints.
GPU Instances
Deploy containers with dedicated NVIDIA GPUs in 60 seconds.
Storage
Persistent volumes with built-in file explorer.
Dedicated Clusters
Reserved bare-metal from 16 to 128+ nodes.
MCP Server
Control Runcrate from Claude, Cursor, or any AI assistant.
Explore use cases
See how teams use Runcrate to build AI products, run inference at scale, train models, and deploy custom servers.AI SaaS Backend
Production AI backend with chat, image, and RAG.
RAG Pipeline
Retrieval-augmented generation with embeddings and vector search.
Fine-tune LLMs
Fine-tune Llama, Mistral, or Qwen on your own data.
Video Generation
Generate videos with Kling, Veo, Sora, and Seedance.
AI Chatbot (Next.js)
Streaming chatbot with the Vercel AI SDK and Runcrate.
ComfyUI in the Cloud
Run ComfyUI on a GPU instance with persistent model storage.
Deploy Llama
Self-host Llama 3 with vLLM for custom inference.
Voice Cloning
Clone any voice and generate speech with TTS models.
Start building
Python
Official Python client. Drop-in replacement for the OpenAI SDK.
TypeScript
Official TypeScript client for Node.js and edge runtimes.
Vercel AI SDK
First-class Runcrate provider for the Vercel AI SDK.
CLI
Full terminal control. Deploy instances, SSH in, transfer files, manage volumes — all from your terminal.