SemiAnalysis launched InferenceMAX, an open-source nightly benchmark comparing GPU inference performance across NVIDIA (H100, H200, B200, GB200 NVL72) and AMD (MI300X, MI325X, MI355X). Endorsed by Jensen Huang, Lisa Su, OpenAI, and Microsoft. First multi-vendor benchmark with TCO and power efficiency metrics.
InferenceMAX runs nightly benchmarks across 7 GPU types using vLLM, SGLang, TRT-LLM
Oct 9, 2025Expanding to Google TPU and AWS Trainium within 2 months for true multi-vendor comparison
Oct 9, 2025Benchmark includes TCO per million tokens and tokens per provisioned MW metrics
Oct 9, 2025