Toggle Poster Visibility
Oral
Thu Apr 24 07:30 PM -- 07:42 PM (PDT) @ Garnet 212-213 None
MAP: Multi-Human-Value Alignment Palette
[
Slides]
[
OpenReview]
Oral
Thu Apr 24 07:42 PM -- 07:54 PM (PDT) @ Garnet 212-213 None
Limits to scalable evaluation at the frontier: LLM as judge won’t beat twice the data
[
OpenReview]
Oral
Thu Apr 24 07:54 PM -- 08:06 PM (PDT) @ Garnet 212-213 None
Trust or Escalate: LLM Judges with Provable Guarantees for Human Agreement
[
OpenReview]
Oral
Thu Apr 24 08:06 PM -- 08:18 PM (PDT) @ Garnet 212-213 None
AI as Humanity’s Salieri: Quantifying Linguistic Creativity of Language Models via Systematic Attribution of Machine Text against Web Text
[
Slides]
[
OpenReview]
Oral
Thu Apr 24 08:18 PM -- 08:30 PM (PDT) @ Garnet 212-213 None
Consistency Checks for Language Model Forecasters
[
OpenReview]
Oral
Thu Apr 24 08:30 PM -- 08:42 PM (PDT) @ Garnet 212-213 None
Probabilistic Learning to Defer: Handling Missing Expert Annotations and Controlling Workload Distribution
[
Slides]
[
OpenReview]
Successful Page Load