DeepSeek R1

ModelsOpen-source leader with excellent math performance

DeepSeek reasoning model with 128K context. Uses chain-of-thought. Exceptional math (96/100) and reasoning (96/100). Open-source competitor to o1.

Metrics

ELO1,280

LMArena human preference ranking

ECI Score139.455

Epoch Capabilities Index score

ECI Normalized88

Normalized ECI score (0-100)

Context Window64,000

Maximum input token capacity

Training Compute4.0e24 FLOP

Total compute used during training

Score Breakdown

code

20%91

Code generation, understanding, and debugging

math

20%94

Mathematical reasoning and problem solving

reasoning

15%77

Multi-step logical reasoning

intelligence

30%88

Overall reasoning and task completion ability

Compatibility(2)

Sources(3)

lmarena.ai

openrouter.ai

epoch.ai

Developed by

DeepSeek

Benchmark Performance

Compare Models

Epoch Capabilities Index

139.5/ 155

Dimension	Score
Intelligence	88.0
Reasoning	77.0
Math	94.0
Code	91.0
Overall	88

Loading chart...

Data: Epoch AI (CC-BY 4.0)

Related Signals(5)

Reasoning Model Scaling Approaching Compute Infrastructure Limits

ModelsDec 31

Labs like OpenAI and Anthropic claim RL reasoning scaling cannot be sustained beyond 1-2 years due to compute infrastructure limits, suggesting the exceptional 2024-2025 capability growth could slow.

DeepSeek Achieves 10x Training Efficiency via Architecture Innovations

ModelsDec 31

DeepSeek V3 used 10x less compute than Llama 3 through MLA (multi-head latent attention), MoE innovations, and multi-token prediction, demonstrating 3x yearly algorithmic efficiency gains.

Frontier AI Capabilities Reach Consumer GPUs Within 12 Months

ModelsDec 31

The best open models runnable on consumer GPUs lag frontier AI by only ~1 year across GPQA, MMLU, and LMArena benchmarks, suggesting rapid capability democratization and regulatory implications.

Grok Code Fast 1 Dominates OpenRouter Usage at 572B Tokens

ModelsDec 29

xAI's Grok Code Fast 1 has surged to the #1 position on OpenRouter with 572.7B tokens processed weekly, more than 3x the second-place model. This dethroned mimo-v2-flash which dropped from #1 (170.9B) to #9 (77.6B), signaling a major shift toward specialized coding models.

DeepSeek R1 Leads Open-Source Math Benchmarks

ModelsDec 16

DeepSeek R1 has emerged as the leading open-source model for mathematical reasoning, outperforming many closed-source alternatives on MATH and GSM8K benchmarks.

Stack

Tools

Registry

Training

Inference

Cost

DeepSeek R1

Metrics

Score Breakdown

Compatibility(2)

Sources(3)

Developed by

Benchmark Performance

Related Signals(5)

Reasoning Model Scaling Approaching Compute Infrastructure Limits

DeepSeek Achieves 10x Training Efficiency via Architecture Innovations

Frontier AI Capabilities Reach Consumer GPUs Within 12 Months

Grok Code Fast 1 Dominates OpenRouter Usage at 572B Tokens

DeepSeek R1 Leads Open-Source Math Benchmarks