Skip to yearly menu bar Skip to main content


Poster

Expert Divergence Learning for MoE-based Language Models

Jiaang Li · Haibin Chen · langming liu · Yujin Yuan · Yadao Wang · Yizhen Zhang · Chengting Yu · Xin Tong · Weidong Zhang · Shilei Liu · wenbo su · Bo Zheng

Abstract

Log in and register to view live content