Skip to yearly menu bar Skip to main content


Poster

MHLA: Restoring Expressivity of Linear Attention via Token-Level Multi-Head

Kewei Zhang · Ye Huang · Yufan Deng · Jincheng YU · Junsong Chen · Huan Ling · Enze Xie · Zhou Daquan

Abstract

Log in and register to view live content