mle-bench
openai/mle-bench
MLE-bench is a benchmark for measuring how well AI agents perform at machine learning engineering
1,459stars
Forks
237
Open issues
4
Watchers
1,459
Size
0.8 MB
PythonOther
Created: Oct 8, 2024
Updated: Apr 14, 2026
Last push: Mar 20, 2026