Skip to yearly menu bar Skip to main content


Poster

Multi-modal Agent Tuning: Building a VLM-Driven Agent for Efficient Tool Usage

Zhi Gao ⋅ Bofei Zhang ⋅ Pengxiang Li ⋅ Xiaojian Ma ⋅ Tao Yuan ⋅ Yue Fan ⋅ Yuwei Wu ⋅ Yunde Jia ⋅ Song-Chun Zhu ⋅ Qing Li
2025 Poster

Abstract

Video

Chat is not available.