Skip to yearly menu bar Skip to main content


Poster

ManagerBench: Evaluating the Safety-Pragmatism Trade-off in Autonomous LLMs

Adi Simhi · Jonathan Herzig · Martin Tutek · Itay Itzhak · Idan Szpektor · Yonatan Belinkov

Abstract

Log in and register to view live content