Note: Overall leaderboard rankings may not reflect true model quality — individual benchmarks give a clearer picture. ARC-Challenge MMLU GPQA GSM8K Artificial Analysis Intelligence Index v4.0
← Back to leaderboard

simpleQA

1 models

Top 10 Models Performance

tencent/hy3-preview-base ######################################## 26.47
68.8K – 862.0B
Rank Model Score
🥇 tencent/hy3-preview-base 26.47