How does Whisper Turbo compare to the full model?

Turbo is roughly 8x faster with only a marginal increase in word error rate. For most production use cases, Turbo is the better choice. Use the full model when maximum accuracy is critical.

Can I get timestamps?

Yes. The API returns word-level or segment-level timestamps when requested, matching the OpenAI Whisper API response format.

What about very long audio files?

Files up to 25MB are supported per request. For longer recordings, split the file into chunks and send them as separate requests.

runcrate

Contact Sales Console

WHISPER API

Whisper inference, no GPU required.

Run OpenAI's Whisper Large V3 and Whisper V3 Turbo without managing GPUs. Send audio, get transcripts. The API is OpenAI-compatible, so your existing code works with a one-line change. Turbo is 8x faster at nearly identical accuracy for latency-sensitive workloads.

$0.02/min

Turbo price

100+

Languages

25MB

Max file size

Get API Key View Pricing

QUICK START

Integrate in minutes.

from openai import OpenAI

client = OpenAI(
    base_url="https://api.runcrate.ai/v1",
    api_key="rc_live_YOUR_API_KEY",
)

# Use Turbo for faster results
transcript = client.audio.transcriptions.create(
    model="openai/whisper-large-v3-turbo",
    file=open("podcast.mp3", "rb"),
)
print(transcript.text)

AVAILABLE MODELS

Models you can use today.

Model	Provider	Price	Detail
openai/whisper-large-v3	OpenAI	$0.045/min	Highest accuracy, best for critical transcription
openai/whisper-large-v3-turbo	OpenAI	$0.02/min	8x faster inference, ideal for real-time

openai/whisper-large-v3

OpenAI$0.045/min

Highest accuracy, best for critical transcription

openai/whisper-large-v3-turbo

OpenAI$0.02/min

8x faster inference, ideal for real-time

WHY RUNCRATE

Built for production.

No GPU Management

Skip the CUDA setup, model loading, and GPU provisioning. Send a file, receive text. Runcrate handles the infrastructure.

Turbo Mode

Whisper V3 Turbo delivers results 8x faster than the full model with near-identical word error rates. Pay less, get results faster.

Automatic Language Detection

Whisper auto-detects the spoken language from 100+ options. No need to specify the language parameter for most use cases.

Production Ready

Handles concurrent requests, retries transparently, and scales automatically. No cold starts, no queue management.

FAQ

Common questions.

Start transcribing with Whisper.

Get API Key View Pricing