Skip to yearly menu bar Skip to main content


Poster

How Does Critical Batch Size Scale in Pre-training?

Hanlin Zhang ⋅ Depen Morwani ⋅ Nikhil Vyas ⋅ Jingfeng Wu ⋅ Difan Zou ⋅ Udaya Ghai ⋅ Dean Foster ⋅ Sham Kakade
2025 Poster

Abstract

Video

Chat is not available.