Running 3 3 41 LLMs Evaluated Locally on 19 Benchmarks ⚡ 41 open-source LLMs benchmarked locally on 19 tasks.
Running 17 17 Bringing paper to life: A modern template for scientific writing 📝 Create a modern scientific paper template
Running on CPU Upgrade 223 223 MMLU-Pro Leaderboard 🥇 More advanced and challenging multi-task evaluation