Note: Overall leaderboard rankings may not reflect true model quality — individual benchmarks give a clearer picture. ARC-Challenge MMLU GPQA GSM8K Artificial Analysis Intelligence Index v4.0

← Back to leaderboard

CritPt

2 models

Top 10 Models Performance

anthropic/claude-opus-4.8	########################################	21
x-ai/grok-4.3	###############	8

Rank	Model	Score
🥇	anthropic/claude-opus-4.8	21
🥈	x-ai/grok-4.3	8

JavaScript enhances filtering and charts. All data is rendered server-side.

View the sitemap for available pages.