Note: Overall leaderboard rankings may not reflect true model quality — individual benchmarks give a clearer picture. ARC-Challenge MMLU GPQA GSM8K Artificial Analysis Intelligence Index v4.0
← Back to leaderboard

openai/o1

3 benchmarks
Artificial Analysis Agentic Index (Maximum Reasoning) 31.1 Artificial Analysis Intelligence Index v4.0 (Maximum Reasoning) 30.7 Artificial Analysis Coding Index (Maximum Reasoning) 20.5