Skip to yearly menu bar Skip to main content


Poster

R-Sparse: Rank-Aware Activation Sparsity for Efficient LLM Inference

Zhenyu Zhang ⋅ Zechun Liu ⋅ Yuandong Tian ⋅ Harshit Khaitan ⋅ Zhangyang Wang ⋅ Steven Li
2025 Poster

Abstract

Video

Chat is not available.