openai-evals
NousResearch/openai-evals
Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.
3stars
Forks
1
Open issues
0
Watchers
3
Size
6.3 MB
Other
Created: Nov 29, 2025
Updated: May 9, 2026
Last push: Nov 29, 2025
Fork