Skip to yearly menu bar Skip to main content


Contributed Talk 2: Catapults in SGD: spikes in the training loss and their impact on generalization through feature learning

Libin Zhu · Chaoyue Liu · Adityanarayanan Radhakrishnan · Misha Belkin

Abstract

Video

Chat is not available.