Skip to yearly menu bar Skip to main content


Poster

VitaBench: Benchmarking LLM Agents with Versatile Interactive Tasks in Real-world Applications

Wei He · Yueqing Sun · Hongyan Hao · Zhikang Xia · Xueyuan Hao · Qi GU · Hui Su · Xunliang Cai

Abstract

Log in and register to view live content