Hugging Face's logo Hugging Face
  • Models
  • Datasets
  • Spaces
  • Docs
  • Enterprise
  • Pricing

  • Log In
  • Sign Up
OpenEvals 's Collections
YourBench
Archived Open LLM Leaderboard (2024-2025)
Research collaborations
Leaderboards related tools
Archived Open LLM Leaderboard (2023-2024)

Archived Open LLM Leaderboard (2024-2025)

updated Apr 2

This leaderboard has been evaluating LLMs from Jun 2024 on IFEval, MuSR, GPQA, MATH, BBH and MMLU-Pro

Upvote
-

  • Running
    124
    124

    Open-LLM performances are plateauing, let’s make the leaderboard steep again

    🏔

    Explore and compare advanced language models on a new leaderboard

    Note Blog on why we made a new version of the Open LLM Leaderboard


  • Running on CPU Upgrade
    13.5k
    13.5k

    Open LLM Leaderboard

    🏆

    Track, rank and evaluate open LLMs and chatbots

    Note The actual leaderboard! With a stylish new ux :)


  • open-llm-leaderboard/contents

    Viewer • Updated Mar 20 • 4.58k • 9.03k • 19

    Note If you want to download the main leaderboard table, you'll find the dataset here!


  • open-llm-leaderboard/results

    Preview • Updated Mar 15 • 3.01k • 15

    Note To extract more detailed aggregated results for each model, look here!


  • open-llm-leaderboard/requests

    Preview • Updated Mar 17 • 44.3k • 12

    Note All models ever submitted to the leaderboard


  • Running on CPU Upgrade
    105
    105

    Open LLM Leaderboard Model Comparator

    🏆

    Compare Open LLM Leaderboard results

Upvote
-
  • Collection guide
  • Browse collections
Company
TOS Privacy About Jobs
Website
Models Datasets OCR模型免费转Markdown Pricing 模型下载攻略