Skip to yearly menu bar Skip to main content


Poster

ST-WebAgentBench: A Benchmark for Evaluating Safety and Trustworthiness in Web Agents

Ido Levy · Ben wiesel · Sami Marreed · Alon Oved · Avi Yaeli · Segev Shlomov

Abstract

Log in and register to view live content