alpaca_eval
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
1,954stars
Forks
303
Open issues
28
Watchers
1,954
Size
307.6 MB
Jupyter NotebookApache License 2.0
deep-learningevaluationfoundation-modelsinstruction-followinglarge-language-modelsleaderboardnlprlhf
Created: May 25, 2023
Updated: Feb 26, 2026
Last push: Aug 9, 2025