Skip to yearly menu bar Skip to main content


Poster

Scaling with Collapse: Efficient and Predictable Training of LLM Families

Shane Bergsma · Bin Zhang · Nolan Dey · Shaheer Muhammad · Gurpreet Gosal · Joel Hestness

Abstract

Log in and register to view live content