evaluation-guidebook
huggingface/evaluation-guidebook
Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
2,091stars
Forks
122
Open issues
5
Watchers
2,091
Size
1.1 MB
Jupyter NotebookOther
evaluationevaluation-metricsguidebooklarge-language-modelsllmmachine-learningtutorial
Created: Oct 9, 2024
Updated: Apr 13, 2026
Last push: Dec 3, 2025