Skip to yearly menu bar Skip to main content


From Dense to Dynamic: Token-Difficulty Driven MoEfication of Pre-Trained LLMs

Kumari Nishu · Sachin Mehta · Samira Abnar · Mehrdad Farajtabar · Maxwell Horton · Mahyar Najibi · Moin Nabi · Minsik Cho · Devang Naik

Abstract

Chat is not available.