Skip to yearly menu bar Skip to main content


Poster

GShard: Scaling Giant Models with Conditional Computation and Automatic Sharding

Dmitry Lepikhin · HyoukJoong Lee · Yuanzhong Xu · Dehao Chen · Orhan Firat · Yanping Huang · Maxim Krikun · Noam Shazeer · Zhifeng Chen
2021 Poster

Abstract

Video

Chat is not available.