Skip to yearly menu bar Skip to main content


Poster

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models

Michael Noukhovitch · Shengyi Huang · Sophie Xhonneux · Arian Hosseini · Rishabh Agarwal · Aaron Courville
2025 Poster

Abstract

Video

Chat is not available.