DeepSeek V4 Pro offers a 1M token context window — enough to process entire codebases, full-length books, or hundreds of pages of legal contracts in a single request. Access it through the Runcrate Models API with no GPU management and no waitlists.Documentation Index
Fetch the complete documentation index at: https://runcrate.ai/docs/llms.txt
Use this file to discover all available pages before exploring further.
Why DeepSeek V4 Pro
| Feature | Detail |
|---|---|
| Context window | 1,000,000 tokens |
| Architecture | Mixture-of-Experts |
| Best for | Long document analysis, code generation, multi-step reasoning |
| API compatibility | OpenAI-compatible chat completions |
Basic chat completion
Long document analysis (1M context)
The 1M context window means you can pass an entire document — a legal contract, research paper, or codebase — directly in the prompt. No chunking, no RAG pipeline, no lost context.Streaming
Addstream: true to any request and iterate over chunks as they arrive. Works with the OpenAI SDK in both Python (for chunk in stream) and TypeScript (for await ... of stream).
Next steps
- Analyze long documents with AI — compare 1M-context models side by side.
- Extract structured data — combine DeepSeek V4 with schema-based extraction.
- Model catalog — browse all available models and pricing.