Skip to yearly menu bar Skip to main content


Simple linear attention language models balance the recall-throughput tradeoff

Simran Arora · Sabri Eyuboglu · Michael Zhang · Aman Timalsina · Silas Alberti · James Y Zou · Atri Rudra · Christopher Re

Abstract

Chat is not available.