Skip to yearly menu bar Skip to main content


Poster

MA-RLHF: Reinforcement Learning from Human Feedback with Macro Actions

Yekun Chai ⋅ Haoran Sun ⋅ Huang Fang ⋅ Shuohuan Wang ⋅ Yu Sun ⋅ hua wu
2025 Poster

Abstract

Video

Chat is not available.