Skip to yearly menu bar Skip to main content


Poster

SafeDialBench: A Fine-Grained Safety Evaluation Benchmark for Large Language Models in Multi-Turn Dialogues with Diverse Jailbreak Attacks

Hongye Cao · Yanming Wang · Sijia Jing · Ziyue Peng · Zhixin Bai · Zhe Cao · Meng Fang · Fan Feng · JIAHENG LIU · Boyan Wang · Tianpei Yang · Jing Huo · Yang Gao · Fanyu Meng · Xi Yang · Chao Deng · Junlan Feng

Abstract

Log in and register to view live content