Skip to yearly menu bar Skip to main content


Poster

JudgeBench: A Benchmark for Evaluating LLM-Based Judges

Sijun Tan · Siyuan Zhuang · Kyle Montgomery · William Tang · Alejandro Cuadron · Chenguang Wang · Raluca Popa · Ion Stoica
2025 Poster

Abstract

Video

Chat is not available.