Skip to yearly menu bar Skip to main content


Poster

BALROG: Benchmarking Agentic LLM and VLM Reasoning On Games

Davide Paglieri ⋅ Bartłomiej Cupiał ⋅ Samuel Coward ⋅ Ulyana Piterbarg ⋅ Maciej Wołczyk ⋅ Akbir Khan ⋅ Eduardo Pignatelli ⋅ Łukasz Kuciński ⋅ Lerrel Pinto ⋅ Rob Fergus ⋅ Jakob Foerster ⋅ Jack Parker-Holder ⋅ Tim Rocktaeschel
2025 Poster

Abstract

Video

Chat is not available.