Skip to yearly menu bar Skip to main content


Why do Learning Rates Transfer? Reconciling Optimization and Scaling Limits for Deep Learning

Alexandru Meterez · Lorenzo Noci · Thomas Hofmann · Antonio Orvieto

Abstract

Chat is not available.