ggml-org/gpt-oss-120b-GGUF

Name: ggml-org/gpt-oss-120b-GGUF
Rating: 5 (62 reviews)
Author: ggml-org

ggufbase_model:openai/gpt-oss-120bbase_model:quantized:openai/gpt-oss-120bendpoints_compatibleregion:usconversational

62

354.2K

gpt-oss-120b

Detailed guide for using this model with llama.cpp:

Quick start:

llama-server -hf ggml-org/gpt-oss-120b-GGUF -c 0 --jinja

# Then, access http://localhost:8080

Run this model on powerful GPU infrastructure. Deploy in 60 seconds.

Pay per second

H100, A100, RTX GPUs

Instant deployment

DEPLOY IN 60 SECONDS

Deploy on H100, A100, or RTX GPUs. Pay only for what you use. No setup required.