Skip to yearly menu bar Skip to main content


Workshop

The Mirage of Action-Dependent Baselines in Reinforcement Learning

George Tucker ⋅ Surya Bhupatiraju ⋅ Shixiang Gu ⋅ Richard E Turner ⋅ Zoubin Ghahramani ⋅ Sergey Levine
[ PDF

Abstract

Video

Chat is not available.