Skip to yearly menu bar Skip to main content


Poster

V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control

Dhruva Tirumala · Arun Ahuja · Martin Riedmiller · Jack Rae · Hubert Soyer · Seb Noury · Nicolas Heess · Jost Tobias Springenberg · Francis Song · SIQI LIU · Abbas Abdolmaleki · Aidan Clark · Dan Belov · Matthew Botvinick

Abstract

Chat is not available.