Star AlbumentationsX on GitHub — it powers this leaderboard

Star on GitHub
← Back to leaderboard
tatsu-lab

alpaca_eval

tatsu-lab/alpaca_eval

An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.

1,954stars
Forks
303
Open issues
28
Watchers
1,954
Size
307.6 MB
Jupyter NotebookApache License 2.0
deep-learningevaluationfoundation-modelsinstruction-followinglarge-language-modelsleaderboardnlprlhf
Created: May 25, 2023
Updated: Feb 26, 2026
Last push: Aug 9, 2025