Skip to yearly menu bar Skip to main content


Poster

Uni-DPO: A Unified Paradigm for Dynamic Preference Optimization of LLMs

Shangpin Peng · Weinong Wang · Zhuotao Tian · Senqiao Yang · Xing W · Haotian Xu · Chengquan Zhang · Takashi Isobe · Baotian Hu · Min Zhang

Abstract

Log in and register to view live content