Skip to yearly menu bar Skip to main content


Poster

FlexPrefill: A Context-Aware Sparse Attention Mechanism for Efficient Long-Sequence Inference

Xunhao Lai · Jianqiao Lu · Yao Luo · Yiyuan Ma · Xun Zhou
2025 Poster

Abstract

Video

Chat is not available.