Note: Overall leaderboard rankings may not reflect true model quality — individual benchmarks give a clearer picture. ARC-Challenge MMLU GPQA GSM8K Artificial Analysis Intelligence Index v4.0
← Back to leaderboard

BoolQ

4 models

Top 10 Models Performance

google/gemma-3-27b-pt ######################################## 82.4
google/gemma-3-12b-pt ###################################### 78.8
google/gemma-3-4b-pt ################################### 72.3
google/gemma-3-1b-pt ############################### 63.2
69K – 862.0B
2019 – 2026
Rank Model Score
🥇 google/gemma-3-27b-pt 82.4
🥈 google/gemma-3-12b-pt 78.8
🥉 google/gemma-3-4b-pt 72.3
4 google/gemma-3-1b-pt 63.2