Leaderboards

Model benchmarks and community rankings

Performance scores from standardized evaluations

MMLU

Massive Multitask Language Understanding - Tests knowledge across 57 subjects

Scores from official papers and third-party evaluations. Results may vary.

Based on ratings, engagement, and contributions

Want to see your work on the leaderboard?