Skip to yearly menu bar Skip to main content


Poster

SWIFT: On-the-Fly Self-Speculative Decoding for LLM Inference Acceleration

Heming Xia ⋅ Yongqi Li ⋅ Jun Zhang ⋅ Cunxiao Du ⋅ Wenjie Li
2025 Poster

Abstract

Video

Chat is not available.