Claude Opus 4.5

Models

Anthropic's flagship model known for nuanced reasoning and code generation

Consistent top-tier performance across benchmarks

Metrics

elo1,410

providerAnthropic

price input$15.00

price output$75.00

context window200,000

Score Breakdown

code94

math91

reasoning92

intelligence95

Compatibility

1111111195 1111111192 1111111190

Scoring Methodology

intelligence30% weight

Overall reasoning and task completion ability

Source: LMArena ELO, Artificial Analysis Intelligence Index, HuggingFace MMLU-PRO

math20% weight

Mathematical reasoning and problem solving

Source: MATH benchmark, GSM8K, HuggingFace MATH-Lvl5

code20% weight

Code generation, understanding, and debugging

Source: HumanEval, MBPP, SWE-bench

reasoning15% weight

Multi-step logical reasoning

Source: ARC-Challenge, BBH, MMLU-Pro, HuggingFace BBH

instruction_following15% weight

Ability to follow complex instructions accurately

Source: HuggingFace IFEval

Related Signals

Gemini 3 Pro Takes LMArena Lead

Models1d ago

Gemini 3 Pro has reached 1490 ELO on LMArena, surpassing Claude Opus 4.5 and GPT-5.1 to claim the top position in human preference rankings.

LangGraph Reaches Production Maturity

Frameworks1d ago

LangGraph has matured into a production-ready framework for complex agent orchestration, with ThoughtWorks recommending 'Adopt' status in their Technology Radar.

Data Sources

lmarena.ai

artificialanalysis.ai

Last updated: December 24, 2025