OpenAI's frontier model ranking ECI 152. exceptional across reasoning, coding, and math benchmarks. exceptional math.
LMArena human preference ranking
Epoch Capabilities Index score
Normalized ECI score (0-100)
Maximum input token capacity
Total compute used during training
Code generation, understanding, and debugging
Mathematical reasoning and problem solving
Multi-step logical reasoning
Overall reasoning and task completion ability
| Dimension | Score |
|---|---|
| Intelligence | 98.0 |
| Reasoning | 100.0 |
| Math | 100.0 |
| Code | 100.0 |
| Overall | 98 |