Skip to yearly menu bar Skip to main content


Oral Sat, Apr 25, 2026 • 7:06 AM – 7:16 AM PDT

The Art of Scaling Reinforcement Learning Compute for LLMs

Devvrit Khatri · Lovish Madaan · Rishabh Tiwari · Rachit Bansal · Venkata Sai Surya Subramanyam Duvvuri · Manzil Zaheer · Inderjit Dhillon · David Brandfonbrener · Rishabh Agarwal

Abstract

Log in and register to view live content