Skip to yearly menu bar Skip to main content


Oral

Rethinking Reward Modeling in Preference-based Large Language Model Alignment

Hao Sun ⋅ Yunyi Shen ⋅ Jean-Francois Ton
2025 Oral

Abstract

Video

Chat is not available.