Skip to yearly menu bar Skip to main content


Virtual presentation / poster accept

Mind the Gap: Offline Policy Optimization for Imperfect Rewards

Jianxiong Li ⋅ Xiao Hu ⋅ Haoran Xu ⋅ Jingjing Liu ⋅ Xianyuan Zhan ⋅ Qing-Shan Jia ⋅ Ya-Qin Zhang

Abstract

Video

Chat is not available.