Skip to yearly menu bar Skip to main content


Beyond Seeing: Evaluating Multimodal LLMs on Tool-Enabled Image Perception, Transformation, and Reasoning

Xingang Guo ⋅ Utkarsh Tyagi ⋅ Advait Gosai ⋅ Paula Vergara ⋅ Jayeon Park ⋅ Ernesto Montoya ⋅ Chen Bo Calvin Zhang ⋅ Bin Hu ⋅ Yunzhong He ⋅ Bing Liu ⋅ Rakshith Sharma Srinivasa

Abstract

Chat is not available.