Skip to yearly menu bar Skip to main content


Poster

AdvChain: Adversarial Chain-of-Thought Tuning for Robust Safety Alignment of Large Reasoning Models

Zihao Zhu · Xinyu Wu · Gehan Hu · Siwei Lyu · Ke Xu · Baoyuan Wu

Abstract

Log in and register to view live content