⭐ Star AlbumentationsX on GitHub — 325+ stars and counting!

Star on GitHub
NousResearch

openai-evals

NousResearch/openai-evals

Evals is a framework for evaluating LLMs and LLM systems, and an open-source registry of benchmarks.

3stars
Forks
1
Open issues
0
Watchers
3
Size
6.3 MB
Other
Created: Nov 29, 2025
Updated: May 9, 2026
Last push: Nov 29, 2025
Fork