Skip to yearly menu bar Skip to main content


Oral

MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering

Jun Shern Chan ⋅ Neil Chowdhury ⋅ Oliver Jaffe ⋅ James Aung ⋅ Dane Sherburn ⋅ Evan Mays ⋅ Giulio Starace ⋅ Kevin Liu ⋅ Leon Maksin ⋅ Tejal Patwardhan ⋅ Aleksander Madry ⋅ Lilian Weng
2025 Oral

Abstract

Video

Chat is not available.