Anthropic flagship model with 200K context. Record ARC-AGI performance. Exceptional reasoning (98/100) and intelligence (97/100). Best for research and complex tasks.
Epoch Capabilities Index score
Normalized ECI score (0-100)
Maximum input token capacity
Organization that created the model
Model development country
Access method (API, download, etc.)
Model weights publicly available
Open source availability
Epoch AI benchmark identifier
Code generation, understanding, and debugging
Mathematical reasoning and problem solving
Multi-step logical reasoning
Overall reasoning and task completion ability
| Dimension | Score |
|---|---|
| Intelligence | 96.0 |
| Reasoning | 98.0 |
| Math | 95.0 |
| Code | 96.0 |
| Instruction Following | 96.0 |
| Overall | 96 |