Reasoning-optimized models (o1-class, GPT-5, Claude 4.5, Gemini 3) now account for over 50% of all LLM token usage, up from negligible share in early 2025. This reflects a fundamental shift from single-pass text completion to multi-step deliberation inference.
Reasoning models exceeded 50% of total tokens by late 2025, up from near-zero in Q1
Dec 1, 2025xAI's Grok Code Fast 1 now leads reasoning traffic, ahead of Gemini 2.5 Pro
Dec 1, 2025Average prompt tokens increased 4x (1.5K to 6K+), completions nearly tripled
Dec 1, 2025