alpaca_eval
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
1,966stars
Forks
307
Open issues
29
Watchers
1,966
Size
307.6 MB
Jupyter NotebookApache License 2.0
deep-learningevaluationfoundation-modelsinstruction-followinglarge-language-modelsleaderboardnlprlhf
Created: May 25, 2023
Updated: Apr 13, 2026
Last push: Aug 9, 2025