Skip to yearly menu bar Skip to main content


Oral

Cybench: A Framework for Evaluating Cybersecurity Capabilities and Risks of Language Models

Andy K Zhang ⋅ Neil Perry ⋅ Riya Dulepet ⋅ Joey Ji ⋅ Celeste Menders ⋅ Justin Lin ⋅ Eliot Jones ⋅ Gashon Hussein ⋅ Samantha Liu ⋅ Donovan Jasper ⋅ Pura Peetathawatchai ⋅ Ari Glenn ⋅ Vikram Sivashankar ⋅ Daniel Zamoshchin ⋅ Leo Glikbarg ⋅ Derek Askaryar ⋅ Haoxiang Yang ⋅ Aolin Zhang ⋅ Rishi Alluri ⋅ Nathan Tran ⋅ Rinnara Sangpisit ⋅ Kenny Oseleononmen ⋅ Dan Boneh ⋅ Daniel Ho ⋅ Percy Liang
2025 Oral

Abstract

Video

Chat is not available.