Skip to main content
POST
/
chat
/
completions
Chat / LLM completion
curl --request POST \
  --url https://api.runcrate.ai/v1/chat/completions \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '
{
  "model": "<string>",
  "messages": [
    {
      "content": "<string>",
      "name": "<string>"
    }
  ],
  "max_tokens": 2,
  "temperature": 1,
  "top_p": 0.5,
  "stop": "<string>",
  "frequency_penalty": 0,
  "presence_penalty": 0,
  "stream": false,
  "tools": [
    {}
  ],
  "tool_choice": "<string>",
  "response_format": {}
}
'
{
  "id": "<string>",
  "created": 123,
  "model": "<string>",
  "choices": [
    {
      "index": 123,
      "message": {
        "content": "<string>",
        "name": "<string>"
      }
    }
  ],
  "usage": {
    "prompt_tokens": 123,
    "completion_tokens": 123,
    "total_tokens": 123
  }
}

Documentation Index

Fetch the complete documentation index at: https://runcrate.ai/docs/llms.txt

Use this file to discover all available pages before exploring further.

Authorizations

Authorization
string
header
required

Use a Runcrate API key with the rc_live_* prefix as the bearer token. Create one at https://www.runcrate.ai/dashboard/api-keys.

Body

application/json
model
string
required

Model id from the catalog (e.g. deepseek/deepseek-v3.2).

messages
object[]
required
max_tokens
integer
Required range: x >= 1
temperature
number
Required range: 0 <= x <= 2
top_p
number
Required range: 0 <= x <= 1
stop
frequency_penalty
number
Required range: -2 <= x <= 2
presence_penalty
number
Required range: -2 <= x <= 2
stream
boolean
default:false
tools
object[]
tool_choice
response_format
object

Response

Completion (or SSE stream when stream=true).

id
string
object
enum<string>
Available options:
chat.completion
created
integer
model
string
choices
object[]
usage
object