Skip to yearly menu bar Skip to main content


Poster

AgentHarm: A Benchmark for Measuring Harmfulness of LLM Agents

Maksym Andriushchenko · Alexandra Souly · Mateusz Dziemian · Derek Duenas · Maxwell Lin · Justin Wang · Dan Hendrycks · Andy Zou · Zico Kolter · Matt Fredrikson · Yarin Gal · Xander Davies
2025 Poster

Abstract

Video

Chat is not available.