Toggle Poster Visibility
Oral
Thu Apr 24 12:30 AM -- 12:42 AM (PDT) @ Garnet 213-215 None
MLE-bench: Evaluating Machine Learning Agents on Machine Learning Engineering
[
OpenReview]
Oral
Thu Apr 24 12:42 AM -- 12:54 AM (PDT) @ Garnet 213-215 None
MMQA: Evaluating LLMs with Multi-Table Multi-Hop Complex Questions
[
OpenReview]
Oral
Thu Apr 24 12:54 AM -- 01:06 AM (PDT) @ Garnet 213-215 None
MMIE: Massive Multimodal Interleaved Comprehension Benchmark for Large Vision-Language Models
[
OpenReview]
Oral
Thu Apr 24 01:06 AM -- 01:18 AM (PDT) @ Garnet 213-215 None
Dynamic Multimodal Evaluation with Flexible Complexity by Vision-Language Bootstrapping
[
OpenReview]
Oral
Thu Apr 24 01:18 AM -- 01:30 AM (PDT) @ Garnet 213-215 None
PathGen-1.6M: 1.6 Million Pathology Image-text Pairs Generation through Multi-agent Collaboration
[
OpenReview]
Oral
Thu Apr 24 01:30 AM -- 01:42 AM (PDT) @ Garnet 213-215 None
Two Effects, One Trigger: On the Modality Gap, Object Bias, and Information Imbalance in Contrastive Vision-Language Models
[
OpenReview]
Successful Page Load