NVIDIA

NVIDIA · Blackwell · 2024

NVIDIA GB200.

Grace + Blackwell unified-memory superchip — frontier-model training at the largest scale.

192 GB
HBM3e
8.0 TB/s
Bandwidth
180
TFLOPS FP16
8× GPU
NVLink
Reserved capacity

Cloud rental

$3.75/hr

On-demand · per-second billing

$3.50$4.00/hr range

Or $2.63/hr reserved (30% off)

Pricing across clouds

NVIDIA GB200 cloud rental price comparison.

Same GPU, cheapest-first. Prices reflect publicly listed hourly rates for NVIDIA GB200 on each provider. Runcrate is the lowest published rate.

RuncrateCheapest
$3.75/hr
Oracle
$5.85/hr
AWS
$6.20/hr
Azure
$6.50/hr
GCP
$6.95/hr

Deploy

One command. NVIDIA GB200 in 60 seconds.

Skip the dashboard if you don't need it. SDK, Python, or cURL — copy the snippet, paste your API key, ship.

import Runcrate from "@runcrate/sdk";

const rc = new Runcrate({ apiKey: "rc_live_••••••••••••••••" });

const instance = await rc.instances.create({
  gpu: "gb200",
  region: "auto",
  image: "runcrate/vllm:latest",
});

console.log(`SSH: ssh root@${instance.host}`);

Benchmarks

How NVIDIA GB200 stacks up.

Side-by-side specs vs the closest alternatives. Bars are normalized to the highest value in each metric.

FP16

NVIDIA GB200180 TFLOPS
NVIDIA B200180 TFLOPS
NVIDIA GB300240 TFLOPS
NVIDIA H200134 TFLOPS

VRAM

NVIDIA GB200192 GB
NVIDIA B200192 GB
NVIDIA GB300256 GB
NVIDIA H200141 GB

Bandwidth

NVIDIA GB2008.0 TB/s
NVIDIA B2008.0 TB/s
NVIDIA GB30010.0 TB/s
NVIDIA H2004.8 TB/s

Full specs

NVIDIA GB200 specifications.

Memory

VRAM192 GB
Memory typeHBM3e
Bandwidth8.0 TB/s

Compute

FP3290 TFLOPS
FP16180 TFLOPS
INT83600 TOPS
Tensor cores18,432

Power & form factor

TDP1000 W
Form factorSuperchip
ArchitectureBlackwell
Released2024

Cluster

Max GPUs/node8
InterconnectNVLink
ManufacturerNVIDIA
Available regions2

FAQ

Frequently asked.

Ready when you are

Deploy NVIDIA GB200
in 60 seconds.

Reserved capacity available · per-second billing · contact us for an immediate quote.