Skip to yearly menu bar Skip to main content


Poster

The Art of Scaling Reinforcement Learning Compute for LLMs

Devvrit Khatri · Lovish Madaan · Rishabh Tiwari · Rachit Bansal · Venkata Sai Surya Subramanyam Duvvuri · Manzil Zaheer · Inderjit Dhillon · David Brandfonbrener · Rishabh Agarwal

Abstract

Log in and register to view live content