Note: Overall leaderboard rankings may not reflect true model quality — individual benchmarks give a clearer picture. ARC-Challenge MMLU GPQA GSM8K Artificial Analysis Intelligence Index v4.0
← Back to leaderboard

CritPt

2 models

Top 10 Models Performance

anthropic/claude-opus-4.8 ######################################## 21
x-ai/grok-4.3 ############### 8
68.8K – 862.0B
Rank Model Score
🥇 anthropic/claude-opus-4.8 21
🥈 x-ai/grok-4.3 8