Skip to yearly menu bar Skip to main content


Poster

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Davide Paglieri · Bartłomiej Cupiał · Samuel Coward · Ulyana Piterbarg · Maciej Wołczyk · Akbir Khan · Eduardo Pignatelli · Łukasz Kuciński · Lerrel Pinto · Rob Fergus · Jakob Foerster · Jack Parker-Holder · Tim Rocktaeschel
2025 Poster

Abstract

Video

Chat is not available.