Skip to yearly menu bar Skip to main content


Poster

Regressing the Relative Future: Efficient Policy Optimization for Multi-turn RLHF

Zhaolin Gao ⋅ Wenhao Zhan ⋅ Jonathan Chang ⋅ Gokul Swamy ⋅ Kianté Brantley ⋅ Jason Lee ⋅ Wen Sun
2025 Poster

Abstract

Video

Chat is not available.