Skip to yearly menu bar Skip to main content


Pivoting Factorization: A Compact Meta Low-Rank Representation of Sparsity for Efficient Inference in Large Language Models

Jialin Zhao · Yingtao Zhang · Carlo Vittorio Cannistraci

Abstract

Chat is not available.