Skip to yearly menu bar Skip to main content


Poster

Language models scale reliably with over-training and on downstream tasks

Samir Yitzhak Gadre ⋅ Georgios Smyrnis ⋅ Vaishaal Shankar ⋅ Suchin Gururangan ⋅ Mitchell Wortsman ⋅ Rulin Shao ⋅ Jean Mercat ⋅ Alex Fang ⋅ Jeffrey Li ⋅ Sedrick Keh ⋅ Rui Xin ⋅ Marianna Nezhurina ⋅ Igor Vasiljevic ⋅ Luca Soldaini ⋅ Jenia Jitsev ⋅ Alex Dimakis ⋅ Gabriel Ilharco ⋅ Pang Wei Koh ⋅ Shuran Song ⋅ Thomas Kollar ⋅ Yair Carmon ⋅ Achal Dave ⋅ Reinhard Heckel ⋅ Niklas Muennighoff ⋅ Ludwig Schmidt
2025 Poster

Abstract

Video

Chat is not available.