Skip to yearly menu bar Skip to main content


Poster

OSTQuant: Refining Large Language Model Quantization with Orthogonal and Scaling Transformations for Better Distribution Fitting

Xing Hu ⋅ Yuan Cheng ⋅ Dawei Yang ⋅ Zhixuan Chen ⋅ Dawei Yang ⋅ Jiangyong Yu ⋅ XUCHEN ⋅ Zhihang Yuan ⋅ Zhe jiang ⋅ Sifan Zhou
2025 Poster

Abstract

Video

Chat is not available.