What is the difference between V3.2 and R1?

V3.2 is a general-purpose chat and code model optimized for throughput and quality. R1 is a reasoning model that generates chain-of-thought traces before responding, excelling at math and complex analysis tasks.

Do I need a DeepSeek account?

No. Runcrate hosts DeepSeek models on its own infrastructure. You only need a Runcrate API key.

Is streaming supported?

Yes. Pass stream: true in your request body to receive server-sent events, identical to the OpenAI streaming format.

runcrate

Contact Sales Console

DEEPSEEK API

DeepSeek V3.2 and R1, ready to call.

Run DeepSeek's frontier models through an OpenAI-compatible endpoint. V3.2 handles general chat and code generation at high throughput. R1 delivers extended chain-of-thought reasoning for math, science, and complex analysis. Both models, one API key.

Models

128K

V3.2 context

OpenAI SDK

Compatibility

Get API Key View Pricing

QUICK START

Integrate in minutes.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.runcrate.ai/v1",
    api_key="rc_live_YOUR_API_KEY",
)

response = client.chat.completions.create(
    model="deepseek-ai/DeepSeek-V3",
    messages=[
        {"role": "user", "content": "Explain the transformer architecture in simple terms."}
    ],
)
print(response.choices[0].message.content)

AVAILABLE MODELS

Models you can use today.

Model	Provider	Price	Detail
deepseek-ai/DeepSeek-V3	DeepSeek	Per-token	128K context, MoE architecture
deepseek-ai/DeepSeek-R1	DeepSeek	Per-token	Extended reasoning, chain-of-thought

deepseek-ai/DeepSeek-V3

DeepSeekPer-token

128K context, MoE architecture

deepseek-ai/DeepSeek-R1

DeepSeekPer-token

Extended reasoning, chain-of-thought

WHY RUNCRATE

Built for production.

Mixture of Experts

DeepSeek V3.2 uses a Mixture-of-Experts architecture that activates only the parameters needed for each token, delivering frontier-quality output at lower compute cost.

Extended Reasoning

DeepSeek R1 generates detailed chain-of-thought traces before answering, excelling at math olympiad problems, code debugging, and multi-step logic.

128K Context Window

Feed entire codebases, research papers, or lengthy documents into a single prompt without truncation.

Drop-In Replacement

Uses the standard OpenAI chat completions format. Swap the base URL and model name, keep everything else.

COMPARISON

Runcrate vs DeepSeek Platform.

Feature	Runcrate	DeepSeek Platform
API format	OpenAI-compatible	OpenAI-compatible
Region availability	US + EU	China-hosted
Rate limits	Flexible, credit-based	Strict tiers
Other models	200+ (Llama, Qwen, Mistral...)	DeepSeek only
Billing	Prepaid credits (USD)	DeepSeek tokens (CNY)

API format

Runcrate: OpenAI-compatible

DeepSeek Platform: OpenAI-compatible

Region availability

Runcrate: US + EU

DeepSeek Platform: China-hosted

Rate limits

Runcrate: Flexible, credit-based

DeepSeek Platform: Strict tiers

Other models

Runcrate: 200+ (Llama, Qwen, Mistral...)

DeepSeek Platform: DeepSeek only

Billing

Runcrate: Prepaid credits (USD)

DeepSeek Platform: DeepSeek tokens (CNY)

FAQ

Common questions.

Start building with DeepSeek.

Get API Key View Pricing