Skip to yearly menu bar Skip to main content


ResQ: Mixed-Precision Quantization of Large Language Models with Low-Rank Residuals

Utkarsh Saxena · Sayeh Sharify · Kaushik Roy · Xin Wang

Abstract

Chat is not available.