Skip to yearly menu bar Skip to main content


Poster

VerifyBench: Benchmarking Reference-based Reward Systems for Large Language Models

Yuchen Yan · Jin Jiang · Zhenbang Ren · Yijun Li · Xudong Cai · Yang Liu · Xin Xu · Mengdi Zhang · Jian Shao · Yongliang Shen · Jun Xiao · Yueting Zhuang

Abstract

Log in and register to view live content