Skip to yearly menu bar Skip to main content


West-of-N: Synthetic Preference Generation for Improved Reward Modeling

AlizĂ©e Pace · Jonathan Mallinson · Eric Malmi · Sebastian Krause · Aliaksei Severyn

Abstract

Chat is not available.