Skip to yearly menu bar Skip to main content


Poster

Tactic: Adaptive Sparse Attention with Clustering and Distribution Fitting for Long-Context LLMs

Tian Tang · Kan Zhu · Qinyu Xu · Zhan Jin · Yile Gu · Zhichen Zeng · Rohan Kadekodi · Liangyu Zhao · Ang Li · Arvind Krishnamurthy · Baris Kasikci

Abstract

Log in and register to view live content