Note: Overall leaderboard rankings may not reflect true model quality — individual benchmarks give a clearer picture. ARC-Challenge MMLU GPQA GSM8K Artificial Analysis Intelligence Index v4.0
← Back to leaderboard

MATH-hard

2 models

Top 10 Models Performance

zyphra/zaya1-base ######################################## 54.15
qwen/qwen3-1.7b-base ######################### 33.2
69K – 862.0B
2019 – 2026
Rank Model Score
🥇 zyphra/zaya1-base 54.15
🥈 qwen/qwen3-1.7b-base 33.2