NVIDIA

NVIDIA · Hopper · 2022

NVIDIA H100.

The most-deployed datacenter GPU for AI training and inference. Available in SXM (NVLink, 3.35 TB/s) and PCIe (standard servers, lower cost) form factors — both with 80GB HBM3. From $1.25/hr.

80 GB
HBM3
3.35 TB/s
SXM bandwidth
120
TFLOPS FP16 (SXM)
8× GPU
NVLink (SXM)
Available now · 8 regions

Cloud rental

$1.25/hr

PCIe from $1.25/hr · avg $1.40/hr

SXM from $1.35/hr · avg $1.50/hr

Per-second billing · no commitments

Choose your variant

H100 SXM vs PCIe.

Same Hopper architecture, same 80GB HBM3. SXM trades a higher price for NVLink and 67% more memory bandwidth. PCIe fits standard servers at a lower hourly rate.

NVIDIA H100 SXM

Most popular
$1.50/hr avg($1.35–$1.65)
  • 80GB HBM3
  • 3,350 GB/s bandwidth
  • 120 TFLOPS FP16
  • NVLink · 8 GPUs/node

NVIDIA H100 PCIe

Best value
$1.40/hr avg($1.25–$1.55)
  • 80GB HBM3
  • 2,000 GB/s bandwidth
  • 102 TFLOPS FP16
  • PCIe · 4 GPUs/node

Pricing across clouds

NVIDIA H100 SXM cloud rental price comparison.

Same GPU, cheapest-first. Prices reflect publicly listed hourly rates for NVIDIA H100 SXM on each provider. Runcrate is the lowest published rate.

RuncrateCheapest
$1.50/hr
RunPod
$1.99/hr
Oracle
$2.40/hr
Lambda
$2.49/hr
GCP
$3.67/hr
Azure
$3.96/hr
AWS
$4.10/hr

Workloads

What you'll actually use this for.

Real workloads sized for the NVIDIA H100 SXM, with concrete performance numbers. Click to deploy preconfigured.

Deploy

One command. NVIDIA H100 SXM in 60 seconds.

Skip the dashboard if you don't need it. SDK, Python, or cURL — copy the snippet, paste your API key, ship.

import Runcrate from "@runcrate/sdk";

const rc = new Runcrate({ apiKey: "rc_live_••••••••••••••••" });

const instance = await rc.instances.create({
  gpu: "h100-sxm",
  region: "auto",
  image: "runcrate/vllm:latest",
});

console.log(`SSH: ssh root@${instance.host}`);

Decision guide

Which H100 should I choose?

Both variants share the same Hopper architecture and 80GB HBM3. The difference is interconnect, bandwidth, and price.

Choose SXM if

  • Multi-GPU distributed training (2-8 GPUs with NVLink)
  • Workloads that saturate memory bandwidth (large-batch inference, long-context)
  • Production serving that needs maximum throughput per GPU
  • Fine-tuning 70B+ models across multiple GPUs

Choose PCIe if

  • Single-GPU inference for models under 70B
  • Budget-conscious teams that need Hopper at the lowest hourly rate
  • Standard server deployments without SXM baseboard requirements
  • QLoRA fine-tuning on a single GPU

Full specs

H100 SXM vs PCIe — side by side.

SpecH100 SXMH100 PCIe
Price (avg)$1.50/hr$1.40/hr
Price (range)$1.35–$1.65/hr$1.25–$1.55/hr
VRAM80 GB80 GB
Memory typeHBM3HBM3
Memory bandwidth3.4 TB/s2.0 TB/s
FP3260 TFLOPS51 TFLOPS
FP16120 TFLOPS102 TFLOPS
INT82400 TOPS2040 TOPS
Tensor cores16,89614,592
CUDA cores16,89614,592
TDP700W350W
Form factorSXMPCIe
NVLinkYesNo
Max GPUs/node84
ArchitectureHopperHopper
Release year20222022

Benchmarks

How NVIDIA H100 SXM stacks up.

Side-by-side specs vs the closest alternatives. Bars are normalized to the highest value in each metric.

FP16

NVIDIA H100 SXM120 TFLOPS
AMD MI250X95.7 TFLOPS
NVIDIA A100 SXM78 TFLOPS
NVIDIA A100 PCIe78 TFLOPS

VRAM

NVIDIA H100 SXM80 GB
AMD MI250X128 GB
NVIDIA A100 SXM80 GB
NVIDIA A100 PCIe80 GB

Bandwidth

NVIDIA H100 SXM3.4 TB/s
AMD MI250X3.3 TB/s
NVIDIA A100 SXM2.0 TB/s
NVIDIA A100 PCIe1.9 TB/s

FAQ

Frequently asked.

Ready when you are

Deploy NVIDIA H100 SXM
in 60 seconds.

Available across 8 regions · per-second billing · no commitments. Ship today.