Skip to yearly menu bar Skip to main content


Poster

Towards Robust Alignment of Language Models: Distributionally Robustifying Direct Preference Optimization

Junkang Wu ⋅ Yuexiang Xie ⋅ Zhengyi Yang ⋅ Jiancan Wu ⋅ Jiawei Chen ⋅ Jinyang Gao ⋅ Bolin Ding ⋅ Xiang Wang ⋅ Xiangnan He
2025 Poster

Abstract

Video

Chat is not available.