evaluation-guidebook
huggingface/evaluation-guidebook
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
2,063stars
Forks
119
Open issues
4
Watchers
2,063
Size
1.1 MB
Jupyter NotebookOther
evaluationevaluation-metricsguidebooklarge-language-modelsllmmachine-learningtutorial
Created: Oct 9, 2024
Updated: Feb 25, 2026
Last push: Dec 3, 2025