OpenAI reasoning-first model with 200K context. Exceptional reasoning (98/100) and math (96/100). Uses extended thinking for complex multi-step problems.
Epoch Capabilities Index score
Normalized ECI score (0-100)
Maximum input token capacity
Organization that created the model
Model development country
Access method (API, download, etc.)
Model weights publicly available
Open source availability
Epoch AI benchmark identifier
Code generation, understanding, and debugging
Mathematical reasoning and problem solving
Multi-step logical reasoning
Overall reasoning and task completion ability
| Dimension | Score |
|---|---|
| Intelligence | 90.0 |
| Reasoning | 100.0 |
| Math | 100.0 |
| Code | 100.0 |
| Instruction Following | 96.0 |
| Overall | 90 |