Skip to yearly menu bar Skip to main content


Poster

WINA: Weight Informed Neuron Activation for Accelerating Large Language Model Inference

Sihan Chen · Dan Zhao · Jongwoo Ko · Colby Banbury · Huiping Zhuang · Luming Liang · Pashmina Cameron · Tianyi Chen

Abstract

Log in and register to view live content