Skip to yearly menu bar Skip to main content


Poster

Large Batch Optimization for Deep Learning: Training BERT in 76 minutes

Xiaodan Song · Jing Li · Cho-Jui Hsieh · Yang You · Srinadh Bhojanapalli · Jonathan Hseu · Sashank Reddi · Kurt Keutzer · Jim Demmel · Sanjiv Kumar

Abstract

Chat is not available.