NVIDIA H200

Accelerators141GB HBM3e enables larger models

Hopper GPU with extended 141GB HBM. 2K TFLOPS FP16 for high-performance compute. 87K tok/s MLPerf. Available from 5 providers starting at $4.50/hr.

Metrics

Memory GB141

High-bandwidth memory capacity

TFLOPS FP161,979

FP16 compute performance

ArchitectureHopper

Neural network architecture type

TDP700W

Thermal design power consumption

Min Cloud Price$4.50/hr

Lowest available cloud rental price

Avg Cloud Price$6.00/hr

Average price across providers

Provider Count5

Number of cloud providers offering this GPU

Price UpdatedDec 28, 2025

Last pricing data refresh date

Score Breakdown

training

15%70

LLM training time-to-convergence

ecosystem

15%97

Software stack maturity and framework support

inference

25%49

LLM inference throughput per GPU

performance

30%97

Raw compute throughput and memory bandwidth

availability

15%88

Market supply and cloud instance availability

Compatibility(3)

Sources(1)

semianalysis.com

Developed by

NVIDIA

Related Signals(5)

MI300X Leads MLPerf Inference v5.1 for LLM Throughput

AcceleratorsJan 10

AMD MI300X achieves highest per-GPU LLM inference throughput in MLPerf Inference v5.1, delivering 21,150 tokens/s per GPU on llama2-70b, outperforming NVIDIA H100 (15,610 tok/s), B200 (13,015 tok/s), and H200 (10,917 tok/s). Industry-standard benchmark validates AMD's competitiveness in AI inference.

NVIDIA AI Compute Stock Doubling Every 10 Months

AcceleratorsDec 31

Installed AI compute from NVIDIA chips has more than doubled annually since 2020, with new flagship chips accounting for most compute within 3 years of release.

AWS Announces P5 Instances with H200 GPUs

Dec 18

AWS has launched P5 instances featuring NVIDIA H200 GPUs, now generally available in US East and West regions with EU availability expected in Q1 2026.

H200 Compatibility Advisory: Framework Updates Required

AcceleratorsDec 14

NVIDIA H200's 141GB HBM3e memory requires updated CUDA drivers and framework versions. Teams should verify compatibility before migration from H100.

InferenceMAX Launches Open Benchmark for AI Accelerators

AcceleratorsOct 9

SemiAnalysis launched InferenceMAX, an open-source nightly benchmark comparing GPU inference performance across NVIDIA (H100, H200, B200, GB200 NVL72) and AMD (MI300X, MI325X, MI355X). Endorsed by Jensen Huang, Lisa Su, OpenAI, and Microsoft. First multi-vendor benchmark with TCO and power efficiency metrics.

Stack

Tools

Registry

Training

Inference

Cost