Skip to yearly menu bar Skip to main content


Poster

Unintentional Unalignment: Likelihood Displacement in Direct Preference Optimization

Noam Razin · Sadhika Malladi · Adithya Bhaskar · Danqi Chen · Sanjeev Arora · Boris Hanin
2025 Poster

Abstract

Video

Chat is not available.