NVIDIA · Hopper · 2023
141GB HBM3e at Hopper prices — the cost/perf sweet spot for production LLM inference.
Cloud rental
On-demand · per-second billing
$2.10–$2.40/hr range
Or $1.57/hr reserved (30% off)
Pricing across clouds
Same GPU, cheapest-first. Prices reflect publicly listed hourly rates for NVIDIA H200 on each provider. Runcrate is the lowest published rate.
Workloads
Real workloads sized for the NVIDIA H200, with concrete performance numbers. Click to deploy preconfigured.
Deploy
Skip the dashboard if you don't need it. SDK, Python, or cURL — copy the snippet, paste your API key, ship.
import Runcrate from "@runcrate/sdk";
const rc = new Runcrate({ apiKey: "rc_live_••••••••••••••••" });
const instance = await rc.instances.create({
gpu: "h200",
region: "auto",
image: "runcrate/vllm:latest",
});
console.log(`SSH: ssh root@${instance.host}`);Benchmarks
Side-by-side specs vs the closest alternatives. Bars are normalized to the highest value in each metric.
FP16
VRAM
Bandwidth
Alternatives
Same Runcrate platform, different price/performance. Hover any GPU to switch.
Hopper · 141GB HBM3e · 134 TFLOPS FP16
Hopper · 80GB HBM3 · 120 TFLOPS FP16
Hopper · 80GB HBM3 · 102 TFLOPS FP16
CDNA 2 · 128GB HBM2e · 95.7 TFLOPS FP16
Blackwell · 192GB HBM3e · 180 TFLOPS FP16
Full specs
Memory
Compute
Power & form factor
Cluster
FAQ
Ready when you are
Available across 6 regions · per-second billing · no commitments. Ship today.
Other GPUs
NVIDIA Hopper
NVIDIA Ampere
NVIDIA Ada Lovelace