Skip to yearly menu bar Skip to main content


Poster

Strategic Obfuscation of Deceptive Reasoning in Language Models

Arun Jose · Niels Warncke · Mia Taylor

Abstract

Log in and register to view live content