Skip to yearly menu bar Skip to main content


Poster

Distributed Speculative Inference (DSI): Speculation Parallelism for Provably Faster Lossless Language Model Inference

Nadav Timor ⋅ Jonathan Mamou ⋅ Daniel Korat ⋅ Moshe Berchansky ⋅ Oren Pereg ⋅ Moshe Wasserblat ⋅ Tomer Galanti ⋅ Michal Gordon-Kiwkowitz ⋅ David Harel
2025 Poster

Abstract

Video

Chat is not available.