Skip to yearly menu bar Skip to main content


Poster

Asynchronous RLHF: Faster and More Efficient Off-Policy RL for Language Models

Michael Noukhovitch ⋅ Shengyi Huang ⋅ Sophie Xhonneux ⋅ Arian Hosseini ⋅ Rishabh Agarwal ⋅ Aaron Courville
2025 Poster

Abstract

Video

Chat is not available.