Skip to yearly menu bar Skip to main content


Poster

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Taishi Nakamura · Satoki Ishikawa · Masaki Kawamura · Okamoto · Daisuke Nohara · Jun Suzuki · Rio Yokota

Abstract

Log in and register to view live content