Skip to yearly menu bar Skip to main content


Oral Fri, Apr 24, 2026 • 6:54 AM – 7:04 AM PDT

Optimal Sparsity of Mixture-of-Experts Language Models for Reasoning Tasks

Taishi Nakamura · Satoki Ishikawa · Masaki Kawamura · Okamoto · Daisuke Nohara · Jun Suzuki · Rio Yokota

Abstract

Log in and register to view live content