Note: Overall leaderboard rankings may not reflect true model quality — individual benchmarks give a clearer picture. ARC-Challenge MMLU GPQA GSM8K Artificial Analysis Intelligence Index
← Back to leaderboard

raincandy-u/rain-100m

4 benchmarks
TruthfulQA 16.28 MMLU 9.06 ARC-Challenge 3.3 WikiText-2 (-ppl) -107.9683