Skip to yearly menu bar Skip to main content


Large Language Models can Strategically Deceive their Users when Put Under Pressure

Jérémy Scheurer · Mikita Balesni · Marius Hobbhahn

Abstract

Chat is not available.