|
Sun 6:00 p.m. - 6:05 p.m.
|
Opening Remarks 📖
(
Intro
)
>
SlidesLive Video
|
🔗
|
|
Sun 6:05 p.m. - 6:35 p.m.
|
Invited Talk: Peter Henderson 🤝🗣️ Copyright Law and Foundation Model Design
(
Invited Talk
)
>
SlidesLive Video
|
Peter Henderson
🔗
|
|
Sun 6:35 p.m. - 7:05 p.m.
|
Invited Talk: Danqi Chen 🤝🗣️ How Data Domains Improve Language Model Pre-Training
(
Invited Talk
)
>
SlidesLive Video
|
Danqi Chen
🔗
|
|
Sun 7:05 p.m. - 7:20 p.m.
|
Coffee Break ☕
|
🔗
|
|
Sun 7:20 p.m. - 7:35 p.m.
|
Spotlight Presentation: Xinran Gu 📊 Data Mixing Can Induce Phase Transitions in Knowledge Acquisition
(
Presentation
)
>
SlidesLive Video
|
Kaifeng Lyu
🔗
|
|
Sun 7:35 p.m. - 7:50 p.m.
|
Spotlight Presentation: Edward Yeo 📊 Demystifying Long CoT Reasoning in LLMs
(
Presentation
)
>
SlidesLive Video
|
Xiang Yue
🔗
|
|
Sun 7:50 p.m. - 9:00 p.m.
|
Poster Session I 🪧
(
Poster Session
)
>
|
🔗
|
|
Sun 9:00 p.m. - 10:30 p.m.
|
Lunch Break 🍲 Lunch Box Bento Provided
(
Lunch
)
>
|
🔗
|
|
Sun 10:30 p.m. - 11:00 p.m.
|
Invited Talk: Vahab Mirrokni 🤝🗣️Data for LLMs: From Mixture and Efficiency to Privacy, and Reasoning
(
Invited Talk
)
>
SlidesLive Video
|
Vahab Mirrokni
🔗
|
|
Sun 11:00 p.m. - 11:30 p.m.
|
Invited Talk: Kyle Lo 🤝🗣️ The OLMo Cookbook: Open Recipes for Language Model Data Curation
(
Invited Talk
)
>
SlidesLive Video
|
Kyle Lo
🔗
|
|
Sun 11:30 p.m. - 11:45 p.m.
|
Spotlight Presentation: Zheng Xu 📊 Synthesizing Privacy-Preserving Text Data via Finetuning without Finetuning Billion-Scale LLMs
(
Presentation
)
>
SlidesLive Video
|
🔗
|
|
Sun 11:45 p.m. - 12:00 a.m.
|
Spotlight Presentation: Brandon Trabucco 📊 Towards Internet-Scale Training For Agents
(
Presentation
)
>
link
SlidesLive Video
|
Brandon Trabucco
🔗
|
|
Mon 12:00 a.m. - 12:30 a.m.
|
Coffee Break ☕
|
🔗
|
|
Mon 12:30 a.m. - 1:00 a.m.
|
Invited Talk: Bryan Low 🤝🗣️ Data-centric AI Research @ GLOW.AI
(
Invited Talk
)
>
SlidesLive Video
|
Bryan Kian Hsiang Low
🔗
|
|
Mon 1:00 a.m. - 1:05 a.m.
|
Closing Remarks 📗
(
Outro
)
>
|
🔗
|
|
Mon 1:05 a.m. - 3:00 a.m.
|
Poster Session II 🪧
(
Poster Session
)
>
|
🔗
|
|
-
|
Chameleon: A Flexible Data-mixing Framework for Language Model Pretraining and Finetuning
(
Poster
)
>
link
|
Wanyun Xie · Francesco Tonin · Volkan Cevher
🔗
|
|
-
|
The Price is Right? Making Data Valuations Incentive-Compatible
(
Poster
)
>
link
SlidesLive Video
|
Dongyang Fan · Tyler Rotello · Sai Karimireddy
🔗
|
|
-
|
Language Model Preference Evaluation with Multiple Weak Evaluators
(
Poster
)
>
link
|
Zhengyu Hu · Jieyu Zhang · Zhihan Xiong · Alexander Ratner · Hui Xiong · Ranjay Krishna
🔗
|
|
-
|
Building Bridges, Not Walls: Advancing Interpretability by Unifying Feature, Data, and Model Component Attribution
(
Poster
)
>
link
|
Shichang Zhang · Tessa Han · Usha Bhalla · Hima Lakkaraju
🔗
|
|
-
|
Revisiting Multi-Modal LLM Evaluation
(
Poster
)
>
link
|
Jian Lu · Shikhar Srivastava · Junyu Chen · Robik Shrestha · Manoj Acharya · Kushal Kafle · Christopher Kanan
🔗
|
|
-
|
$f$-SCRUB: Unbounded Machine Unlearning Via $f$-divergences
(
Poster
)
>
link
SlidesLive Video
|
Amirhossein Bagheri · Radmehr Karimian · Gholamali Aminian
🔗
|
|
-
|
Preserving Product Fidelity in Large Scale Image Recontextualization with Diffusion Models
(
Poster
)
>
link
SlidesLive Video
|
Ishaan Malhi · Praneet Dutta · Ellie Talius · Sally Ma · Brendan Driscoll · Krista Holden · Garima Pruthi · Arunachalam Narayanaswamy
🔗
|
|
-
|
TOWARD EFFICIENT INFLUENCE FUNCTION: DROPOUT AS A COMPRESSION TOOL
(
Poster
)
>
link
SlidesLive Video
|
Yuchen Zhang · Mohammad Mohammadi Amiri
🔗
|
|
-
|
Explaining Length Bias in LLM-Based Preference Evaluations
(
Poster
)
>
link
|
Zhengyu Hu · Linxin Song · Jieyu Zhang · Zheyuan Xiao · Zhengyu Chen · Hui Xiong
🔗
|
|
-
|
DUET: Optimizing Training Data Mixtures via Feedback from Unseen Evaluation Tasks
(
Poster
)
>
link
SlidesLive Video
|
Zhiliang Chen · Gregory Kang Ruey Lau · Chuan Sheng Foo · Bryan Kian Hsiang Low
🔗
|
|
-
|
Differentially Private Synthetic Data via APIs 3: Using Simulators Instead of Foundation Model
(
Poster
)
>
link
|
Zinan Lin · Tadas Baltrusaitis · Sergey Yekhanin
🔗
|
|
-
|
Approximations to worst-case data dropping: unmasking failure modes
(
Poster
)
>
link
SlidesLive Video
|
Jenny Huang · David Burt · Yunyi Shen · Tin Nguyen · Tamara Broderick
🔗
|
|
-
|
Adversarial Attacks on Data Attribution
(
Poster
)
>
link
|
Xinhe Wang · Pingbang Hu · Junwei Deng · Jiaqi Ma
🔗
|
|
-
|
Data Efficient Pre-training for Language Models: An Empirical Study of Compute Efficiency and Linguistic Competence
(
Poster
)
>
link
|
Andreas Paraskeva · Max van Duijn · Maarten de Rijke · Suzan Verberne · Jan Rijn
🔗
|
|
-
|
Autoregressive Optimal Design for Language Models
(
Poster
)
>
link
|
Rohan Deb · Kiran Thekumparampil · Kousha Kalantari · Gaurush Hiranandani · Shoham Sabach · Branislav Kveton
🔗
|
|
-
|
PiKE: Adaptive Data Mixing for Multi-Task Learning Under Low Gradient Conflicts
(
Poster
)
>
link
|
Zeman Li · Yuan Deng · Peilin Zhong · Meisam Razaviyayn · Vahab Mirrokni
🔗
|
|
-
|
Defending LVLMs Against Vision Attacks through Partial-Perception Supervision
(
Poster
)
>
link
|
Qi Zhou · Tianlin Li · Qing Guo · Dongxia Wang · Yun Lin · Yang Liu · Jin Song Dong
🔗
|
|
-
|
OpenRAG: Optimizing RAG End-to-End via In-Context Retrieval Learning
(
Poster
)
>
link
|
Jiawei Zhou · Lei Chen
🔗
|
|
-
|
Nepotistically Trained Generative Image Models Collapse
(
Poster
)
>
link
SlidesLive Video
|
Maty Bohacek · Hany Farid
🔗
|
|
-
|
MMA: Benchmarking Multi-Modal Large Language Model in Ambiguity Contexts
(
Poster
)
>
link
|
Ru Wang · Selena Song · Liang Ding · Mingming Gong · Yusuke Iwasawa · Yutaka Matsuo · Jiaxian Guo
🔗
|
|
-
|
TsKAN: An Transparent Architecture for Improving the Interpretability of Multivariate Time Series Forecasting
(
Poster
)
>
link
|
Zechuan Chen · TianMing Sha · Ziyi Tang · Keze Wang
🔗
|
|
-
|
Revisiting Semi-supervised Adversarial Robustness via Noise-aware Online Robust Distillation
(
Poster
)
>
link
|
Tsung-Han Wu · Hung-Ting Su · Shang-Tse Chen · Winston Hsu
🔗
|
|
-
|
Privacy Attacks on Image AutoRegressive Models
(
Poster
)
>
link
|
Antoni Kowalczuk · Jan Dubiński · Franziska Boenisch · Adam Dziedzic
🔗
|
|
-
|
Tracing the Misuse of Personalized Textual Embeddings for Text-to-Image Models
(
Poster
)
>
link
|
Weitao Feng · Jiyan He · Jie Zhang · Tianyi Wei · Wenbo Zhou · Qing Guo · Weiming Zhang · Tianwei Zhang · Nenghai Yu
🔗
|
|
-
|
RichSpace: Enriching Text-to-Video Prompt Space via Text Embedding Interpolation
(
Poster
)
>
link
|
Yuefan Cao · Chengyue Gong · Xiaoyu Li · Yingyu Liang · Zhizhou Sha · Zhenmei Shi · Zhao Song
🔗
|
|
-
|
Be like a Goldfish, Don't Memorize! Mitigating Memorization in Generative LLMs
(
Poster
)
>
link
|
11 presenters
Abhimanyu Hans · Yuxin Wen · Neel Jain · John Kirchenbauer · Hamid Kazemi · Prajwal Singhania · Siddharth Singh · Gowthami Somepalli · Jonas Geiping · Abhinav Bhatele · Tom Goldstein
🔗
|
|
-
|
NICE: Non-Differentiable Evaluation Metric-Based Data Selection for Instruction Tuning
(
Poster
)
>
link
|
Jingtan Wang · Xiaoqiang Lin · Rui Qiao · Pang Wei Koh · Chuan Sheng Foo · Bryan Kian Hsiang Low
🔗
|
|
-
|
STAMP Your Content: Proving Dataset Membership via Watermarked Rephrasings
(
Poster
)
>
link
|
Saksham Rastogi · Pratyush Maini · Danish Pruthi
🔗
|
|
-
|
Blind Baselines Beat Membership Inference Attacks for Foundation Models
(
Poster
)
>
link
|
Debeshee Das · Jie Zhang · Florian Tramer
🔗
|
|
-
|
RepFair-QGAN: Alleviating Representation Bias in Quantum Generative Adversarial Networks Using Gradient Clipping
(
Poster
)
>
link
|
Kamil Sabbagh · Hadi Salloum · Yaroslav Kholodov
🔗
|
|
-
|
The Delta Learning Hypothesis: Preference Tuning on Weak Data Can Yield Strong Gains
(
Poster
)
>
link
|
Scott Geng · Hamish Ivison · Chun-Liang Li · Maarten Sap · Jerry Li · Ranjay Krishna · Pang Wei Koh
🔗
|
|
-
|
Diversity Measurement and Subset Selection for Instruction Tuning Datasets
(
Poster
)
>
link
|
Peiqi Wang · Yikang Shen · Gavin (Zhen) Guo · Matthew Stallone · Yoon Kim · Polina Golland · Rameswar Panda
🔗
|
|
-
|
Unstable Unlearning: The Hidden Risk of Concept Resurgence in Diffusion Models
(
Poster
)
>
link
|
Vinith Suriyakumar · Rohan Alur · Ayush Sekhari · Manish Raghavan · Ashia Wilson
🔗
|
|
-
|
Reward-Augmented Data Enhances Direct Preference Alignment of LLMs
(
Poster
)
>
link
|
Shenao Zhang · Zhihan Liu · Boyi Liu · Yufeng Zhang · Yingxiang Yang · Yongfei Liu · Liyu Chen · TAO SUN · Zhaoran Wang
🔗
|
|
-
|
Beyond ordinary Lipschitz constraints: Differentially Private optimization with TNC
(
Poster
)
>
link
|
Difei Xu · Meng Ding · Zihang Xiang · Jinhui Xu · Di Wang
🔗
|
|
-
|
Template Matters: Understanding the Role of Instruction Templates in Multimodal Language Model Evaluation and Training
(
Poster
)
>
link
|
Shijian Wang · Linxin Song · Jieyu Zhang · Ryotaro Shimizu · Ao Luo · Li Yao · Cunjian Chen · Julian McAuley · Hanqian Wu
🔗
|
|
-
|
Information-theoretic Quantification of Inherent Discrimination Bias in Training Data for Supervised Learning
(
Poster
)
>
link
SlidesLive Video
|
Sokrat Aldarmini · Mohamed Nafea
🔗
|
|
-
|
Data Mixing Can Induce Phase Transitions in Knowledge Acquisition
(
Poster
)
>
link
|
Xinran Gu · Kaifeng Lyu · Jiazheng Li · Jingzhao Zhang
🔗
|
|
-
|
Privacy Auditing for Large Language Models with Natural Identifiers
(
Poster
)
>
link
|
Lorenzo Rossi · Bartłomiej Marek · Franziska Boenisch · Adam Dziedzic
🔗
|
|
-
|
SubLIME*: Data Efficient Foundation Model Evaluation across Modalities, Languages and Benchmarks
(
Poster
)
>
link
|
Mahammad Parwez Alam · Gayathri Saranathan · Cong Xu · Javier Aula-Blasco · Martin Foltin · Tarun Kumar · Soon Wong · Suparna Bhattacharya
🔗
|
|
-
|
BenchAgents: Automated Benchmark Creation with Agent Interaction
(
Poster
)
>
link
|
Natasha Butt · Varun Chandrasekaran · Neel Joshi · Besmira Nushi · Vidhisha Balachandran
🔗
|
|
-
|
Enhancing Multilingual LLM Pretraining with Model-Based Data Selection
(
Poster
)
>
link
SlidesLive Video
|
Bettina Messmer · Vinko Sabolčec · Martin Jaggi
🔗
|
|
-
|
The surprising amount of arbitrariness in Shapley-value data valuation
(
Poster
)
>
link
|
Hannah Diehl · Ashia Wilson
🔗
|
|
-
|
Training and Evaluating Language Models with Template-based Data Generation
(
Poster
)
>
link
|
Yifan Zhang
🔗
|
|
-
|
D$^3$: A Large Dataset for Training Code Language Models to Act Diff-by-Diff
(
Poster
)
>
link
|
Ulyana Piterbarg · Kanishk Gandhi · Lerrel Pinto · Noah Goodman · Rob Fergus
🔗
|
|
-
|
Towards Internet-Scale Training For Agents
(
Poster
)
>
link
|
Brandon Trabucco · Gunnar Sigurdsson · Robinson Piramuthu · Russ Salakhutdinov
🔗
|
|
-
|
MFC-Bench: Benchmarking Multimodal Fact-Checking with Large Vision-Language Models
(
Poster
)
>
link
|
Shengkang Wang · Hongzhan Lin · Ziyang Luo · Zhen Ye · Guang Chen · Jing Ma
🔗
|
|
-
|
Improving Influence-based Instruction Tuning Data Selection for Balanced Learning of Diverse Capabilities
(
Poster
)
>
link
|
Qirun Dai · Dylan Zhang · Jiaqi Ma · Hao Peng
🔗
|
|
-
|
Common Functional Decompositions Can Mis-attribute Differences in Outcomes Between Populations
(
Poster
)
>
link
SlidesLive Video
|
Manuel Quintero · William Stephenson · Advik Shreekumar · Tamara Broderick
🔗
|
|
-
|
Aioli: A Unified Optimization Framework for Language Model Data Mixing
(
Poster
)
>
link
|
Mayee Chen · Michael Hu · Nicholas Lourie · Kyunghyun Cho · Christopher Re
🔗
|
|
-
|
Robust In-Context Learning via Multi-Armed Bandit-Based Partition Selection
(
Poster
)
>
link
|
Varul Srivastava · Sankarshan Damle · Manisha Padala
🔗
|
|
-
|
Synthesizing Physical Backdoor Datasets: An Automated Framework Leveraging Deep Generative Models
(
Poster
)
>
link
SlidesLive Video
|
Sze Jue Yang · Chinh La · Hung Quang Nguyen · Eugene Bagdasarian · Kok-Seng Wong · Anh T Tran · Chee Seng Chan · Khoa Doan
🔗
|
|
-
|
LoBAM: LoRA-Based Backdoor Attack on Model Merging
(
Poster
)
>
link
|
Ming Yin · Jingyang Zhang · Jingwei Sun · Minghong Fang · Hai Li · Yiran Chen
🔗
|
|
-
|
Context-Guided Responsible Data Augmentation with Diffusion Models
(
Poster
)
>
link
|
Khawar Islam · NAVEED AKHTAR
🔗
|
|
-
|
Context-Parametric Inversion: Why Instruction Finetuning Can Worsen Context Reliance
(
Poster
)
>
link
|
Sachin Goyal · Christina Baek · Zico Kolter · Aditi Raghunathan
🔗
|
|
-
|
Investigating Memorization in Video Diffusion Models
(
Poster
)
>
link
|
Chen Chen · Enhuai Liu · Daochang Liu · Mubarak Shah · Chang Xu
🔗
|
|
-
|
Utilizing Language Models For Synthetic Knowledge Graph Generation
(
Poster
)
>
link
|
Shuran Fu · Peihua Mai · Zhang Jingqi · Yan Pang
🔗
|
|
-
|
The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination
(
Poster
)
>
link
|
Yifan Sun · Han Wang · Dongbai Li · Gang Wang · Huan Zhang
🔗
|
|
-
|
Unlocking Post-hoc Dataset Inference with Synthetic Data
(
Poster
)
>
link
|
Bihe Zhao · Pratyush Maini · Franziska Boenisch · Adam Dziedzic
🔗
|
|
-
|
A Missing Testbed for LLM Pre-Training Membership Inference Attacks
(
Poster
)
>
link
|
Mingjian Jiang · Ken Liu · Sanmi Koyejo
🔗
|
|
-
|
Abg-SciQA: A dataset for Understanding and Resolving Ambiguity in Scientific Questions
(
Poster
)
>
link
SlidesLive Video
|
Tiejin Chen · Kuan-Ru Liou · Mithun Shivakoti · Aaryan Gaur · Pragya Kumari · Meiqi Guo · Hua Wei
🔗
|
|
-
|
Why Does Private Fine-Tuning Resist Differential Privacy Noise? A Representation Learning Perspective
(
Poster
)
>
link
|
Yue Zhao · Yutong Xia · Chendi Wang
🔗
|
|
-
|
Understanding Private Learning From Feature Perspective
(
Poster
)
>
link
|
Meng Ding · Mingxi Lei · Shaopeng Fu · Di Wang · Jinhui Xu
🔗
|
|
-
|
Proper Dataset Valuation by Pointwise Mutual Information
(
Poster
)
>
link
|
Shuran Zheng · Xuan Qi · Rui Chen · Yongchan Kwon · James Y Zou
🔗
|
|
-
|
Synthesizing Privacy-Preserving Text Data via Finetuning *without* Finetuning Billion-Scale LLMs
(
Poster
)
>
link
|
Bowen Tan · Zheng Xu · Eric P Xing · Zhiting Hu · Shanshan Wu
🔗
|
|
-
|
On the Power of Context-Enhanced Learning in LLMs
(
Poster
)
>
link
|
Xingyu Zhu · Abhishek Panigrahi · Sanjeev Arora
🔗
|
|
-
|
Contrastive Private Data Synthesis via Weighted Multi-PLM Fusion
(
Poster
)
>
link
|
TIANYUAN ZOU · Yang Liu · Peng Li · Yufei Xiong · Jianqing Zhang · Jingjing Liu · Ye Ouyang · Xiaozhou Ye · Yaqin Zhang
🔗
|
|
-
|
Lightweight Dataset Pruning without Full Training via Example Difficulty and Prediction Uncertainty
(
Poster
)
>
link
|
Yeseul Cho · Baekrok Shin · Changmin Kang · Chulhee Yun
🔗
|
|
-
|
How much of my dataset did you use? Quantitative Data Usage Inference in Machine Learning
(
Poster
)
>
link
|
Yao Tong · Jiayuan Ye · Sajjad Zarifzadeh · Reza Shokri
🔗
|
|
-
|
Editable Concept Bottleneck Models
(
Poster
)
>
link
|
Lijie Hu · Chenyang Ren · Zhengyu Hu · Hongbin Lin · Chenglong Wang · Zhen Tan · Weimin Lyu · Jingfeng Zhang · Hui Xiong · Di Wang
🔗
|
|
-
|
Generalizing from SIMPLE to HARD Visual Reasoning: Can We Mitigate Modality Imbalance in VLMs?
(
Poster
)
>
link
SlidesLive Video
|
Simon Park · Abhishek Panigrahi · Yun Cheng · Dingli Yu · Anirudh Goyal · Sanjeev Arora
🔗
|
|
-
|
Demystifying Long CoT Reasoning in LLMs
(
Poster
)
>
link
|
Edward Yeo · Yuxuan Tong · Xinyao Niu · Graham Neubig · Xiang Yue
🔗
|
|
-
|
Rule-Based Rating and Selection of LLM Training Data
(
Poster
)
>
link
|
Xiaomin Li · Mingye Gao · Zhiwei Zhang · Chang Yue · Hong Hu
🔗
|
|
-
|
Query-dependent Prompt Optimization via Multi-Loop Offline Reinforcement Learning
(
Poster
)
>
link
|
Yilun Kong · Hangyu Mao · Qi Zhao · Bin Zhang · Jingqing Ruan · Li Shen · Yongzhe Chang · Xueqian Wang · Rui Zhao · Dacheng Tao
🔗
|
|
-
|
ADSO: Adaptive Data Mixture & Scale Optimization. A Multi-Scale Multi-Fidelity Bayesian Optimization Approach.
(
Poster
)
>
link
|
Andrew Siah · Haozhe Chen · C. Daniel Guetta · Tianyi Peng · Hongseok Namkoong · Thomson Yen
🔗
|
|
-
|
Towards Comprehensive Preference Data Collection for Reward Modeling
(
Poster
)
>
link
|
Yulan Hu · Qingyang Li · Sheng Ouyang · Ge Chen · Jinman Zhao · Yong Liu
🔗
|
|
-
|
Towards Human-Guided, Data-Centric LLM Co-Pilots
(
Poster
)
>
link
|
Evgeny Saveliev · Jiashuo Liu · Nabeel Seedat · Anders Boyd · Mihaela van der Schaar
🔗
|
|
-
|
Improving Multimodal Large Language Models in Low-Resource Language Contexts
(
Poster
)
>
link
|
Yufei Gao · Feijiaying · Guohang Yan · Yunshi Lan
🔗
|
|
-
|
Enhancing Interpretability in Generative AI Through Search-Based Data Influence Analysis
(
Poster
)
>
link
SlidesLive Video
|
Theodoros Aivalis · Iraklis A. Klampanos · Antonis Troumpoukis · Joemon Jose
🔗
|
|
-
|
KGGen: Text To Knowledge Graph
(
Poster
)
>
link
|
Belinda Mo · Kyssen Yu · Joshua Kazdan · Proud Mpala · Lisa Yu · Chris Cundy · Charilaos Kanatsoulis · Sanmi Koyejo
🔗
|
|
-
|
Position: What's the next frontier for Data-centric AI? Data Savvy Agents!
(
Poster
)
>
link
|
Nabeel Seedat · Jiashuo Liu · Mihaela van der Schaar
🔗
|
|
-
|
Domain-Specific Benchmarking of Vision-Language Models: A Task Augmentation Framework Using Metadata
(
Poster
)
>
link
|
Tim Rädsch · Leon Mayer · Simon Pavicic · Ali Emre Kavur · Marcel Knopp · Barış Öztürk · Klaus Maier-Hein · Paul Jaeger · Fabian Isensee · Annika Reinke
🔗
|
|
-
|
Model Collapse in the Self-Consuming Chain of Diffusion Finetuning: A Novel Perspective from Quantitative Trait Modeling
(
Poster
)
>
link
SlidesLive Video
|
Youngseok Yoon · Dainong Hu · Iain Weissburg · Yao Qin · Haewon Jeong
🔗
|
|
-
|
A Versatile Influence Function for Data Attribution with Non-Decomposable Loss
(
Poster
)
>
link
|
Junwei Deng · Weijing Tang · Jiaqi Ma
🔗
|
|
-
|
PhantomWiki: On-Demand Datasets for Reasoning and Retrieval Evaluation
(
Poster
)
>
link
SlidesLive Video
|
Albert Gong · Kamilė Stankevičiūtė · Chao Wan · Anmol Kabra · Raphael Thesmar · Johann Lee · Julius Klenke · Carla Gomes · Kilian Weinberger
🔗
|