Skip to yearly menu bar Skip to main content


Poster

PARD: Accelerating LLM Inference with Low‑Cost PARallel Draft Model Adaptation

Zihao An · Huajun Bai · Ziqiong Liu · Dong Li · Emad Barsoum

Abstract

Log in and register to view live content