Skip to yearly menu bar Skip to main content


Poster

SPG: Sandwiched Policy Gradient for Masked Diffusion Language Models

Chenyu Wang · Paria Rashidinejad · Andy (DiJia) Su · Song Jiang · Sid Wang · Siyan Zhao · Cai Zhou · Shannon Shen · Feiyu Chen · Tommi Jaakkola · Yuandong Tian · Bo Liu

Abstract

Log in and register to view live content