Star AlbumentationsX on GitHub — it powers this leaderboard

Star on GitHub
← Back to leaderboard
openai

mle-bench

openai/mle-bench

MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering

1,330stars
Forks
219
Open issues
4
Watchers
1,330
Size
0.7 MB
PythonOther
Created: Oct 8, 2024
Updated: Feb 27, 2026
Last push: Feb 26, 2026