Note: Overall leaderboard rankings may not reflect true model quality — individual benchmarks give a clearer picture. ARC-Challenge MMLU GPQA GSM8K Artificial Analysis Intelligence Index
← Back to leaderboard

MATH-Vision

2 models

Top 10 Models Performance

google/gemma-4-31b-it ######################################## 85.6
google/gemma-4-e4b-it ############################ 59.5
Rank Model Score
🥇 google/gemma-4-31b-it 85.6
🥈 google/gemma-4-e4b-it 59.5