Skip to yearly menu bar Skip to main content


Poster

One Step of Gradient Descent is Provably the Optimal In-Context Learner with One Layer of Linear Self-Attention

Arvind Mahankali · Tatsunori Hashimoto · Tengyu Ma
2024 Poster

Abstract

Video

Chat is not available.