Chat / LLM completion

curl --request POST \ --url https://api.runcrate.ai/v1/chat/completions \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: application/json' \ --data ' { "model": "<string>", "messages": [ { "content": "<string>", "name": "<string>" } ], "max_tokens": 2, "temperature": 1, "top_p": 0.5, "stop": "<string>", "frequency_penalty": 0, "presence_penalty": 0, "stream": false, "tools": [ {} ], "tool_choice": "<string>", "response_format": {} } '

{ "id": "<string>", "object": "chat.completion", "created": 123, "model": "<string>", "choices": [ { "index": 123, "message": { "content": "<string>", "name": "<string>" } } ], "usage": { "prompt_tokens": 123, "completion_tokens": 123, "total_tokens": 123 } }

Authorizations

Authorization

string

header

required

Use a Runcrate API key with the rc_live_* prefix as the bearer token. Create one at https://www.runcrate.ai/dashboard/api-keys.

Body

application/json

model

string

required

Model id from the catalog (e.g. deepseek/deepseek-v3.2).

messages

object[]

required

Show child attributes

max_tokens

integer

Required range: x >= 1

temperature

number

Required range: 0 <= x <= 2

top_p

number

Required range: 0 <= x <= 1

stop

frequency_penalty

number

Required range: -2 <= x <= 2

presence_penalty

number

Required range: -2 <= x <= 2

stream

boolean

default:false

tools

object[]

tool_choice

response_format

object

Response

Completion (or SSE stream when stream=true).

string

object

enum<string>

Available options:

chat.completion

created

integer

model

string

choices

object[]

Show child attributes

usage

object

Show child attributes