NVIDIA

NVIDIA · Hopper · 2023

NVIDIA H200.

141GB HBM3e at Hopper prices — the cost/perf sweet spot for production LLM inference.

Available now · 6 regions
From $2.10/hr

Memory

141 GB

HBM3e

Bandwidth

4.8 TB/s

Memory I/O

FP16

134

TFLOPS

TDP

700 W

Hopper

Workloads

What you'll actually use this for.

Real workloads sized for the NVIDIA H200, with concrete performance numbers. Click to deploy preconfigured.

Specs

Full specifications.

GPU Memory141 GB HBM3e
Memory Bandwidth4800 GB/s
FP32 Performance67 TFLOPS
FP16 Performance134 TFLOPS
INT8 Performance2680 TOPS
Tensor Cores16,896
CUDA Cores16,896
TDP700 W
ArchitectureHopper
Form FactorSXM
NVLinkYes
Max GPUs per Node8
Release Year2023

FAQ

Frequently asked.

Rent the NVIDIA H200 on Runcrate.

Available now · 6 regions · per-second billing · from $2.10/hr

Other GPUs