OpenAI next-gen reasoning model with 200K context. Record ARC-AGI scores. Exceptional reasoning (99/100) and math (98/100). Best for research and complex problem-solving.
Epoch Capabilities Index score
Normalized ECI score (0-100)
Maximum input token capacity
Organization that created the model
Model development country
Access method (API, download, etc.)
Model weights publicly available
Open source availability
Epoch AI benchmark identifier
Code generation, understanding, and debugging
Mathematical reasoning and problem solving
Multi-step logical reasoning
Overall reasoning and task completion ability
| Dimension | Score |
|---|---|
| Intelligence | 95.0 |
| Reasoning | 100.0 |
| Math | 100.0 |
| Code | 100.0 |
| Instruction Following | 98.0 |
| Overall | 95 |