Skip to yearly menu bar Skip to main content


Poster

Deconstructing What Makes a Good Optimizer for Autoregressive Language Models

Rosie Zhao ⋅ Depen Morwani ⋅ David Brandfonbrener ⋅ Nikhil Vyas ⋅ Sham Kakade
2025 Poster

Abstract

Video

Chat is not available.