Skip to yearly menu bar Skip to main content


Poster

QLLM: Accurate and Efficient Low-Bitwidth Quantization for Large Language Models

Jing Liu · Ruihao Gong · Xiuying Wei · Zhiwei Dong · Jianfei Cai · Bohan Zhuang
2024 Poster

Abstract

Video

Chat is not available.