Skip to yearly menu bar Skip to main content


Poster

Towards Greater Leverage: Scaling Laws for Efficient Mixture-of-Experts Language Models

Changxin Tian · Kunlong Chen · Jia Liu · Ziqi Liu · Zhiqiang Zhang · JUN ZHOU

Abstract

Log in and register to view live content