evaluation-guidebook
huggingface/evaluation-guidebook
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
2,111stars
Forks
123
Open issues
5
Watchers
2,111
Size
1.1 MB
Jupyter NotebookOther
evaluationevaluation-metricsguidebooklarge-language-modelsllmmachine-learningtutorial
Created: Oct 9, 2024
Updated: May 30, 2026
Last push: Dec 3, 2025