evals
openai/evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
18,197stars
Forks
2,922
Open issues
184
Watchers
18,197
Size
6.5 MB
PythonOther
Created: Jan 23, 2023
Updated: Apr 14, 2026
Last push: Apr 13, 2026