alpaca_eval
tatsu-lab/alpaca_eval
An automatic evaluator for instruction-following language models. Human-validated, high-quality, cheap, and fast.
1,990stars
Forks
309
Open issues
29
Watchers
1,990
Size
307.6 MB
Jupyter NotebookApache License 2.0
deep-learningevaluationfoundation-modelsinstruction-followinglarge-language-modelsleaderboardnlprlhf
Created: May 25, 2023
Updated: May 29, 2026
Last push: Aug 9, 2025