Note: Overall leaderboard rankings may not reflect true model quality — individual benchmarks give a clearer picture. ARC-Challenge MMLU GPQA GSM8K Artificial Analysis Intelligence Index
← Back to leaderboard

amd/amd-llama-135m

10 benchmarks
SciQ 76.1 PIQA 64.2 WinoGrande 50.12 ARC-Easy 43.64 WSC 36.54 Lambada 33.3 HellaSwag 30.48 MMLU 23.02 LogiQA 21.2 ARC-Challenge 19.11