Skip to yearly menu bar Skip to main content


Llamba: Scaling Distilled Recurrent Models for Efficient Language Processing

Aviv Bick · Tobias Katsch · Nimit Sohoni · Arjun Desai · Albert Gu

Abstract

Video

Chat is not available.