Back to Benchmarks

VPCT

Visual perception and comprehension tasks

Frontier
Category:specialized
EDI:145.0
Slope:1.61
View Source

Leaderboard

(11 models)
RankModelScoreStderr
1Gemini 3 Pro91.00
2GPT-5.284.00
3o4-mini-2025-04-16 medium57.50
4o352.00
5Gemini 2.5 Pro (Jun 2025)48.00
6GPT-4.145.00
7GPT-4o40.00
8Claude Opus 4.540.00
9Claude 3.7 Sonnet39.00
10o137.00
11gpt-4o-mini-2024-07-1834.00

Data source: Epoch AI, “Data on AI Benchmarking”. Published at epoch.ai

Licensed under CC-BY 4.0