← Back to models8285
D
DeepSeek R1
Models
Leading open-source model with strong reasoning capabilities
Open-source leader with excellent math performance
Metrics
elo1,280
providerDeepSeek
price input$0.55
price output$2.19
context window64,000
Score Breakdown
code85
math88
reasoning72
intelligence83
Compatibility
Scoring Methodology
intelligence30% weight
Overall reasoning and task completion ability
Source: LMArena ELO, Artificial Analysis Intelligence Index, HuggingFace MMLU-PRO
math20% weight
Mathematical reasoning and problem solving
Source: MATH benchmark, GSM8K, HuggingFace MATH-Lvl5
code20% weight
Code generation, understanding, and debugging
Source: HumanEval, MBPP, SWE-bench
reasoning15% weight
Multi-step logical reasoning
Source: ARC-Challenge, BBH, MMLU-Pro, HuggingFace BBH
instruction_following15% weight
Ability to follow complex instructions accurately
Source: HuggingFace IFEval
Related Signals
DeepSeek R1 Leads Open-Source Math Benchmarks
Models1d ago
DeepSeek R1 has emerged as the leading open-source model for mathematical reasoning, outperforming many closed-source alternatives on MATH and GSM8K benchmarks.
Data Sources
Last updated: December 24, 2025