Skip to yearly menu bar Skip to main content


Poster

Hymba: A Hybrid-head Architecture for Small Language Models

Xin Dong ⋅ Yonggan Fu ⋅ Shizhe Diao ⋅ Wonmin Byeon ⋅ ZIJIA CHEN ⋅ Ameya Mahabaleshwarkar ⋅ Shih-Yang Liu ⋅ Matthijs Van keirsbilck ⋅ Min-Hung Chen ⋅ Yoshi Suhara ⋅ Yingyan Celine Lin ⋅ Jan Kautz ⋅ Pavlo Molchanov
2025 Poster

Abstract

Video

Chat is not available.