Skip to yearly menu bar Skip to main content


Poster

Joint Reward and Policy Learning with Demonstrations and Human Feedback Improves Alignment

Chenliang Li ⋅ Siliang Zeng ⋅ Zeyi Liao ⋅ Jiaxiang Li ⋅ Dongyeop Kang ⋅ Alfredo Garcia ⋅ Mingyi Hong
2025 Poster

Abstract

Video

Chat is not available.