ICLR 2021 Tuesday 05/4

Timezone: Europe/Vienna

Full Schedule Mon 5/3 Tue 5/4 Wed 5/5 Thu 5/6 Fri 5/7

Invited Talk

Commonsense AI: Myth and Truth

Yejin Choi

1:00 AM - 2:00 AM

Despite considerable advances in deep learning, AI remains to be narrow and brittle. One fundamental limitation is its lack of commonsense intelligence: trivial for humans, but mysteriously hard for machines. In this talk, I'll discuss the myth and truth about commonsense AI---the blend between symbolic and neural knowledge, the continuum between knowledge and reasoning, and the interplay between reasoning and language generation

... more

Speaker Bio

Yejin Choi is a Brett Helsel associate professor at the Paul G. Allen School of Computer Science & Engineering at the University of Washington and also a senior research manager at AI2 overseeing the project Mosaic. Her research interests include language grounding with vision, physical and social commonsense knowledge, language generation with long-term coherence, conversational AI, and AI for social good. She is a co-recipient of the AAAI Outstanding Paper Award in 2020, a recipient of Borg Early Career Award (BECA) in 2018, among the IEEE’s AI Top 10 to Watch in 2015, a co-recipient of the Marr Prize at ICCV 2013, and a faculty advisor for the Sounding Board team that won the inaugural Alexa Prize Challenge in 2017. Her work on detecting deceptive reviews, predicting the literary success, and interpreting bias and connotation has been featured by numerous media outlets including NBC News for New York, NPR Radio, New York Times, and Bloomberg Business Week. She received her Ph.D. in Computer Science from Cornell University.

... more

Poster

Poster Session 03

2:00 AM - 4:00 AM

75 Events in this session

Benefit of deep learning with non-convex noisy gradient descent: Provable excess risk bound and superiority to kernel methods

Taiji Suzuki · Akiyama Shunta

Deberta: Decoding-Enhanced Bert With Disentangled Attention

Pengcheng He · Xiaodong Liu · Jianfeng Gao · Weizhu Chen

Decentralized Attribution of Generative Models

Changhoon Kim · Yi Ren · 'YZ' Yezhou Yang

Latent Skill Planning for Exploration and Transfer

Kevin Xie · Homanga Bharadhwaj · Danijar Hafner · Animesh Garg · Florian Shkurti

Learning Energy-Based Models by Diffusion Recovery Likelihood

Ruiqi Gao · Yang Song · Ben Poole · Yingnian Wu · Durk Kingma

MONGOOSE: A Learnable LSH Framework for Efficient Neural Network Training

Beidi Chen · Zichang Liu · Binghui Peng · Zhaozhuo Xu · Jonathan L Li · Tri Dao · Zhao Song · Anshumali Shrivastava · Christopher Re

Random Feature Attention

Hao Peng · Nikolaos Pappas · Dani Yogatama · Roy Schwartz · Noah Smith · Lingpeng Kong

Regularized Inverse Reinforcement Learning

Wonseok Jeon · Chen-Yang Su · Paul Barde · Thang Doan · Derek Nowrouzezahrai · Joelle Pineau

Remembering for the Right Reasons: Explanations Reduce Catastrophic Forgetting

Sayna Ebrahimi · Suzanne Petryk · Akash Gokul · William Gan · Joseph E Gonzalez · Marcus Rohrbach · trevor darrell

Representation Learning for Sequence Data with Deep Autoencoding Predictive Components

Junwen Bai · Weiran Wang · Yingbo Zhou · Caiming Xiong

Robust Curriculum Learning: from clean label detection to noisy label self-correction

Tianyi Zhou · Shengjie Wang · Jeff Bilmes

Sequential Density Ratio Estimation for Simultaneous Optimization of Speed and Accuracy

Akinori Ebihara · Taiki Miyagawa · Kazuyuki Sakurai · Hitoshi Imaoka

SOLAR: Sparse Orthogonal Learned and Random Embeddings

Tharun Kumar Reddy Medini · Beidi Chen · Anshumali Shrivastava

The Intrinsic Dimension of Images and Its Impact on Learning

Phil Pope · Chen Zhu · Ahmed Abdelkader · Micah Goldblum · Tom Goldstein

When does preconditioning help or hurt generalization?

Shun-ichi Amari · Jimmy Ba · Roger Grosse · Xuechen Li · Atsushi Nitanda · Taiji Suzuki · Denny Wu · Ji Xu

When Optimizing $f$-Divergence is Robust with Label Noise

Jiaheng Wei · Yang Liu

Zero-shot Synthesis with Group-Supervised Learning

Yunhao Ge · Sami Abu-El-Haija · Gan Xin · Laurent Itti

Bypassing the Ambient Dimension: Private SGD with Gradient Subspace Identification

Yingxue Zhou · Steven Wu · Arindam Banerjee

Contextual Transformation Networks for Online Continual Learning

Quang Pham · Chenghao Liu · Doyen Sahoo · Steven HOI

Deep Partition Aggregation: Provable Defenses against General Poisoning Attacks

Alexander Levine · Soheil Feizi

DeLighT: Deep and Light-weight Transformer

Sachin Mehta · Marjan Ghazvininejad · Srini Iyer · Luke Zettlemoyer · Hannaneh Hajishirzi

Explaining the Efficacy of Counterfactually Augmented Data

Divyansh Kaushik · Amrith Setlur · Eduard H Hovy · Zachary Lipton

Federated Learning Based on Dynamic Regularization

Durmus Alp Emre Acar · Yue Zhao · Ramon Matas · Matthew Mattina · Paul Whatmough · Venkatesh Saligrama

Improved Estimation of Concentration Under $\ell_p$-Norm Distance Metrics Using Half Spaces

Jack Prescott · Xiao Zhang · David Evans

Incorporating Symmetry into Deep Dynamics Models for Improved Generalization

Rui Wang · Robin Walters · Rose Yu

Layer-adaptive Sparsity for the Magnitude-based Pruning

Jaeho Lee · Sejun Park · Sangwoo Mo · Sungsoo Ahn · Jinwoo Shin

Learning a Latent Simplex in Input Sparsity Time

Ainesh Bakshi · Chiranjib Bhattacharyya · Ravi Kannan · David Woodruff · Samson Zhou

Learning A Minimax Optimizer: A Pilot Study

Jiayi Shen · Xiaohan Chen · Howard Heaton · Tianlong Chen · Jialin Liu · Wotao Yin · Zhangyang Wang

Long Live the Lottery: The Existence of Winning Tickets in Lifelong Learning

Tianlong Chen · Zhenyu Zhang · Sijia Liu · Shiyu Chang · Zhangyang Wang

MALI: A memory efficient and reverse accurate integrator for Neural ODEs

Juntang Zhuang · Nicha C Dvornek · sekhar tatikonda · James s Duncan

Meta-Learning with Neural Tangent Kernels

Yufan Zhou · Zhenyi Wang · Jiayi Xian · Changyou Chen · Jinhui Xu

MixKD: Towards Efficient Distillation of Large-scale Language Models

Kevin Liang · Weituo Hao · Dinghan Shen · Yufan Zhou · Weizhu Chen · Changyou Chen · Lawrence Carin

Model Patching: Closing the Subgroup Performance Gap with Data Augmentation

Karan Goel · Albert Gu · Yixuan Li · Christopher Re

MoPro: Webly Supervised Learning with Momentum Prototypes

Junnan Li · Caiming Xiong · Steven Hoi

Neural Attention Distillation: Erasing Backdoor Triggers from Deep Neural Networks

Yige Li · Xixiang Lyu · Nodens Koren · Lingjuan Lyu · Bo Li · Xingjun Ma

Offline Model-Based Optimization via Normalized Maximum Likelihood Estimation

Justin Fu · Sergey Levine

On Dyadic Fairness: Exploring and Mitigating Bias in Graph Connections

Peizhao Li · Yifei Wang · Han Zhao · Pengyu Hong · Hongfu Liu

One Network Fits All? Modular versus Monolithic Task Formulations in Neural Networks

Atish Agarwala · Abhimanyu Das · Brendan Juba · Rina Panigrahy · Vatsal Sharan · Xin Wang · Qiuyi Zhang

On Fast Adversarial Robustness Adaptation in Model-Agnostic Meta-Learning

Ren Wang · Kaidi Xu · Sijia Liu · Pin-Yu Chen · Tsui-Wei Weng · Chuang Gan · Meng Wang

Online Adversarial Purification based on Self-supervised Learning

Changhao Shi · Chester Holtz · Gal Mishne

On the geometry of generalization and memorization in deep neural networks

Cory Stephenson · Suchismita Padhy · Abhinav Ganesh · Yue Hui · Hanlin Tang · SueYeon Chung

Optimal Regularization can Mitigate Double Descent

Preetum Nakkiran · Prayaag Venkat · Sham M Kakade · Tengyu Ma

Optimizing Memory Placement using Evolutionary Graph Reinforcement Learning

Shauharda Khadka · Estelle Aflalo · Mattias Marder · Avrech Ben-David · Santiago Miret · Shie Mannor · Tamir Hazan · Hanlin Tang · Somdeb Majumdar

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Avi Singh · Huihan Liu · Gaoyue Zhou · Albert Yu · Nicholas Rhinehart · Sergey Levine

Partitioned Learned Bloom Filters

Kapil Vaidya · Eric Knorr · Michael Mitzenmacher · Tim Kraska

Personalized Federated Learning with First Order Model Optimization

Michael Zhang · Karan Sapra · Sanja Fidler · Serena Yeung · Jose M. Alvarez

PlasticineLab: A Soft-Body Manipulation Benchmark with Differentiable Physics

Zhiao Huang · Yuanming Hu · Tao Du · Siyuan Zhou · Hao Su · Joshua B Tenenbaum · Chuang Gan

Proximal Gradient Descent-Ascent: Variable Convergence under KŁ Geometry

Ziyi Chen · Yi Zhou · Tengyu Xu · Yingbin Liang

PseudoSeg: Designing Pseudo Labels for Semantic Segmentation

Yuliang Zou · Zizhao Zhang · Han Zhang · Chun-Liang Li · Xiao Bian · Jia-Bin Huang · Tomas Pfister

Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control

Zhuang Liu · Xuanlin Li · Bingyi Kang · trevor darrell

Rethinking Architecture Selection in Differentiable NAS

Ruochen Wang · Minhao Cheng · Xiangning Chen · Xiaocheng Tang · Cho-Jui Hsieh

Rethinking Positional Encoding in Language Pre-training

Guolin Ke · Di He · Tie-Yan Liu

Robust and Generalizable Visual Representation Learning via Random Convolutions

Zhenlin Xu · Deyi Liu · Junlin Yang · Colin Raffel · Marc Niethammer

Robust Reinforcement Learning on State Observations with Learned Optimal Adversary

Huan Zhang · Hongge Chen · Duane S Boning · Cho-Jui Hsieh

SAFENet: A Secure, Accurate and Fast Neural Network Inference

Qian Lou · Yilin Shen · Hongxia Jin · Lei Jiang

Score-Based Generative Modeling through Stochastic Differential Equations

Yang Song · Jascha Sohl-Dickstein · Durk Kingma · Abhishek Kumar · Stefano Ermon · Ben Poole

SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing

Tao Yu · Rui Zhang · Alex Polozov · Christopher Meek · Ahmed H Awadallah

Selective Classification Can Magnify Disparities Across Groups

Erik Jones · Shiori Sagawa · Pang Wei Koh · Ananya Kumar · Percy Liang

Self-training For Few-shot Transfer Across Extreme Task Differences

Cheng Perng Phoo · Bharath Hariharan

Semi-supervised Keypoint Localization

Olga Moskvyak · Frederic Maire · Feras Dayoub · Mahsa Baktashmotlagh

SenSeI: Sensitive Set Invariance for Enforcing Individual Fairness

Mikhail Yurochkin · Yuekai Sun

Spatio-Temporal Graph Scattering Transform

Chao Pan · Siheng Chen · Antonio Ortega

Taking Notes on the Fly Helps Language Pre-Training

Qiyu Wu · Chen Xing · Yatao Li · Guolin Ke · Di He · Tie-Yan Liu

The Deep Bootstrap Framework: Good Online Learners are Good Offline Generalizers

Preetum Nakkiran · Behnam Neyshabur · Hanie Sedghi

The Role of Momentum Parameters in the Optimal Convergence of Adaptive Polyak's Heavy-ball Methods

Wei Tao · sheng long · Gaowei Wu · Qing Tao

Tilted Empirical Risk Minimization

Tian Li · Ahmad Beirami · Maziar Sanjabi · Virginia Smith

Undistillable: Making A Nasty Teacher That CANNOT teach students

Haoyu Ma · Tianlong Chen · Ting-Kuei Hu · Chenyu You · Xiaohui Xie · Zhangyang Wang

Unlearnable Examples: Making Personal Data Unexploitable

Hanxun Huang · Xingjun Ma · Sarah Erfani · James Bailey · Yisen Wang

UPDeT: Universal Multi-agent RL via Policy Decoupling with Transformers

Siyi Hu · Fengda Zhu · Xiaojun Chang · Xiaodan Liang

VA-RED$^2$: Video Adaptive Redundancy Reduction

Bowen Pan · Rameswar Panda · Camilo L Fosco · Chung-Ching Lin · Alex J Andonian · Yue Meng · Kate Saenko · Aude Oliva · Rogerio Feris

Variational Intrinsic Control Revisited

Taehwan Kwon

Very Deep VAEs Generalize Autoregressive Models and Can Outperform Them on Images

Rewon Child

WaveGrad: Estimating Gradients for Waveform Generation

Nanxin Chen · Yu Zhang · Heiga Zen · Ron Weiss · Mohammad Norouzi · William Chan

What are the Statistical Limits of Offline RL with Linear Function Approximation?

Ruosong Wang · Dean Foster · Sham M Kakade

Why resampling outperforms reweighting for correcting sampling bias with stochastic gradients

Jing An · Lexing Ying · Yuhua Zhu

Go to Event Page

Oral

Oral Session 3

4:00 AM - 7:16 AM

17 Events in this session

SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments

Glen Berseth · Daniel Geng · Coline M Devin · Nicholas Rhinehart · Chelsea Finn · Dinesh Jayaraman · Sergey Levine

Contrastive Explanations for Reinforcement Learning via Embedded Self Predictions

Zhengxian Lin · Kin-Ho Lam · Alan Fern

Parrot: Data-Driven Behavioral Priors for Reinforcement Learning

Avi Singh · Huihan Liu · Gaoyue Zhou · Albert Yu · Nicholas Rhinehart · Sergey Levine

Structured Prediction as Translation between Augmented Natural Languages

Giovanni Paolini · Ben Athiwaratkun · Jason Krone · Jie Ma · Alessandro Achille · RISHITA ANUBHAI · Cicero Nogueira dos Santos · Bing Xiang · Stefano Soatto

Mathematical Reasoning via Self-supervised Skip-tree Training

Markus Rabe · Dennis Lee · Kshitij Bansal · Christian Szegedy

Q&A

Improving Adversarial Robustness via Channel-wise Activation Suppressing

Yang Bai · Yuyuan Zeng · Yong Jiang · Shu-Tao Xia · Xingjun Ma · Yisen Wang

Fast Geometric Projections for Local Robustness Certification

Aymeric Fromherz · Klas Leino · Matt Fredrikson · Bryan Parno · Corina Pasareanu

Information Laundering for Model Privacy

Xinran Wang · Yu Xiang · Jun Gao · Jie Ding

Dataset Inference: Ownership Resolution in Machine Learning

Pratyush Maini · Mohammad Yaghini · Nicolas Papernot

HW-NAS-Bench: Hardware-Aware Neural Architecture Search Benchmark

Chaojian Li · Zhongzhi Yu · Yonggan Fu · Yongan Zhang · Yang Zhao · Haoran You · Qixuan Yu · Yue Wang · Cong Hao · Yingyan Lin

Q&A

How Neural Networks Extrapolate: From Feedforward to Graph Neural Networks

Keyulu Xu · Mozhi Zhang · Jingling Li · Simon Du · Ken-Ichi Kawarabayashi · Stefanie Jegelka

Graph Convolution with Low-rank Learnable Local Filters

Xiuyuan Cheng · Zichen Miao · Qiang Qiu

The Traveling Observer Model: Multi-task Learning Through Spatial Variable Embeddings

Elliot Meyerson · Risto Miikkulainen

Meta-GMVAE: Mixture of Gaussian VAE for Unsupervised Meta-Learning

Dong Bok Lee · Dongchan Min · Seanie Lee · Sung Ju Hwang

Q&A

Go to Event Page

Invited Talk

Geometric Deep Learning: the Erlangen Programme of ML

Michael Bronstein

9:00 AM - 10:00 AM

For nearly two millennia, the word "geometry" was synonymous with Euclidean geometry, as no other types of geometry existed. Euclid's monopoly came to an end in the 19th century when multiple examples of non-Euclidean geometries were constructed. However, these studies quickly diverged into disparate fields, with mathematicians debating the relations between different geometries and what defines one. A way out of this pickle was shown by Felix Klein in his Erlangen Programme, which proposed approaching geometry as the study of invariants or symmetries using the language of group theory. In the 20th century, these ideas have been fundamental in developing modern physics, culminating in the Standard Model.

The current state of deep learning somewhat resembles the situation in the field of geometry in the 19h century: On the one hand, in the past decade deep learning has brought a revolution in data science and made possible many tasks previously thought to be beyond reach -- including computer vision, playing Go, or protein folding. At the same time, we have a zoo of neural network architectures for various kinds of data, but few unifying principles. As in times past, it is difficult to understand the relations between different methods, inevitably resulting in the reinvention and re-branding of the same concepts.

Geometric Deep Learning aims to bring geometric unification to deep learning in the spirit of the Erlangen Programme. Such an endeavour serves a dual purpose: it provides a common mathematical framework to study the most successful neural network architectures, such as CNNs, RNNs, GNNs, and Transformers, and gives a constructive procedure to incorporate prior knowledge into neural networks and build future architectures in a principled way. In this talk, I will overview the mathematical principles underlying Geometric Deep Learning on grids, graphs, and manifolds, and show some of the exciting and groundbreaking applications of these methods in a broad range of domains.

(based on joint work with J. Bruna, T. Cohen, and P. Veličković)

... more

Speaker Bio

Michael Bronstein is a professor at Imperial College London, where he holds the Chair in Machine Learning and Pattern Recognition, and Head of Graph Learning Research at Twitter. He also heads ML research in Project CETI, a TED Audacious Prize-winning collaboration aimed at understanding the communication of sperm whales. Michael received his PhD from the Technion in 2007. He has held visiting appointments at Stanford, MIT, and Harvard, and has also been affiliated with the Institute for Advanced Study at TUM (as a Rudolf Diesel Fellow, 2017-2019) and Harvard (as a Radcliffe fellow, 2017-2018). Michael is the recipient of the Royal Society Wolfson Research Merit Award, Royal Academy of Engineering Silver Medal, five ERC grants, two Google Faculty Research Awards, and two Amazon AWS ML Research Awards. He is a Member of the Academia Europaea, Fellow of IEEE, IAPR, BCS, and ELLIS, ACM Distinguished Speaker, and World Economic Forum Young Scientist. In addition to his academic career, Michael is a serial entrepreneur and founder of multiple startup companies, including Novafora, Invision (acquired by Intel in 2012), Videocites, and Fabula AI (acquired by Twitter in 2019). He has previously served as Principal Engineer at Intel Perceptual Computing and was one of the key developers of the Intel RealSense technology.

... more

Poster

Poster Session 04

10:00 AM - 12:00 PM

67 Events in this session

A Distributional Approach to Controlled Text Generation

Muhammad Khalifa · Hady Elsahar · Marc Dymetman

Bayesian Context Aggregation for Neural Processes

Michael Volpp · Fabian Flürenbrock · Lukas Grossberger · Christian Daniel · Gerhard Neumann

Capturing Label Characteristics in VAEs

Tom Joy · Sebastian Schmon · Philip Torr · Siddharth N · Tom Rainforth

Class Normalization for (Continual)? Generalized Zero-Shot Learning

Ivan Skorokhodov · Mohamed Elhoseiny

Complex Query Answering with Neural Link Predictors

Erik Arakelyan · Daniel Daza · Pasquale Minervini · Michael Cochez

Contemplating Real-World Object Classification

Ali Borji

Disambiguating Symbolic Expressions in Informal Documents

Dennis Müller · Cezary Kaliszyk

Do 2D GANs Know 3D Shape? Unsupervised 3D Shape Reconstruction from 2D Image GANs

Xingang Pan · Bo DAI · Ziwei Liu · Chen Change Loy · Ping Luo

Effective Abstract Reasoning with Dual-Contrast Network

Tao Zhuo · Mohan Kankanhalli

Efficient Certified Defenses Against Patch Attacks on Image Classifiers

Jan Hendrik Metzen · Maksym Yatsura

Exemplary Natural Images Explain CNN Activations Better than State-of-the-Art Feature Visualization

Judith Borowski · Roland Zimmermann · Judith Schepers · Robert Geirhos · Thomas S Wallis · Matthias Bethge · Wieland Brendel

Generalization in data-driven models of primary visual cortex

Konstantin-Klemens Lurz · Mohammad Bashiri · Konstantin Willeke · Akshay Jagadish · Eric Wang · Edgar Walker · Santiago Cadena · Taliah Muhammad · Erick M Cobos · Andreas Tolias · Alexander S Ecker · Fabian Sinz

Generalized Multimodal ELBO

Thomas Sutter · Imant Daunhawer · Julia E Vogt

Group Equivariant Stand-Alone Self-Attention For Vision

David W. Romero · Jean-Baptiste Cordonnier

Identifying nonlinear dynamical systems with multiple time scales and long-range dependencies

Dominik Schmidt · Georgia Koppe · Zahra Monfared · Max Beutelspacher · Daniel Durstewitz

Large-width functional asymptotics for deep Gaussian neural networks

Daniele Bracale · Stefano Favaro · Sandra Fortini · Stefano Peluchetti

Lossless Compression of Structured Convolutional Models via Lifting

Gustav Sourek · Filip Zelezny · Ondrej Kuzelka

MiCE: Mixture of Contrastive Experts for Unsupervised Image Clustering

Tsung Wei Tsai · Chongxuan Li · Jun Zhu

PDE-Driven Spatiotemporal Disentanglement

Jérémie DONA · Jean-Yves Franceschi · sylvain lamprier · patrick gallinari

Policy-Driven Attack: Learning to Query for Hard-label Black-box Adversarial Examples

Ziang Yan · Yiwen Guo · Jian Liang · Changshui Zhang

Probing BERT in Hyperbolic Spaces

Boli Chen · Yao Fu · Guangwei Xu · Pengjun Xie · Chuanqi Tan · Mosha Chen · Liping Jing

Sample-Efficient Automated Deep Reinforcement Learning

Jörg Franke · Gregor Koehler · André Biedenkapp · Frank Hutter

Scaling the Convex Barrier with Active Sets

Alessandro De Palma · Harkirat Singh Behl · Rudy R Bunel · Philip Torr · M. Pawan Kumar

SkipW: Resource Adaptable RNN with Strict Upper Computational Limit

Tsiry MAYET · Anne Lambert · Pascal Le Guyadec · Francoise Le Bolzer · François Schnitzler

A Block Minifloat Representation for Training Deep Neural Networks

Sean Fox · Seyedramin Rasoulinezhad · Julian Faraone · david boland · Philip Leong

Accurate Learning of Graph Representations with Graph Multiset Pooling

Jinheon Baek · Minki Kang · Sung Ju Hwang

Activation-level uncertainty in deep neural networks

Pablo Morales-Alvarez · Daniel Hernández-Lobato · Rafael Molina · José Miguel Hernández Lobato

AdaGCN: Adaboosting Graph Convolutional Networks into Deep Models

Ke Sun · Zhanxing Zhu · Zhouchen Lin

A Trainable Optimal Transport Embedding for Feature Aggregation and its Relationship to Attention

Grégoire Mialon · Dexiong Chen · Alexandre d'Aspremont · Julien Mairal

A Universal Representation Transformer Layer for Few-Shot Image Classification

Lu Liu · William Hamilton · Guodong Long · Jing Jiang · Hugo Larochelle

Auxiliary Learning by Implicit Differentiation

Aviv Navon · Idan Achituve · Haggai Maron · Gal Chechik · Ethan Fetaya

BOIL: Towards Representation Change for Few-shot Learning

Jaehoon Oh · Hyungjun Yoo · ChangHwan Kim · Se-Young Yun

Calibration tests beyond classification

David Widmann · Fredrik Lindsten · Dave Zachariah

Computational Separation Between Convolutional and Fully-Connected Networks

Eran Malach · Shai Shalev-Shwartz

Conformation-Guided Molecular Representation with Hamiltonian Neural Networks

Ziyao Li · Shuwen Yang · Guojie Song · Lingsheng Cai

Coping with Label Shift via Distributionally Robust Optimisation

Jingzhao Zhang · Aditya Krishna Menon · Andreas Veit · Srinadh Bhojanapalli · Sanjiv Kumar · Suvrit Sra

Deep Repulsive Clustering of Ordered Data Based on Order-Identity Decomposition

Seon-Ho Lee · Chang-Su Kim

Distributed Momentum for Byzantine-resilient Stochastic Gradient Descent

El Mahdi El Mhamdi · Rachid Guerraoui · Sébastien Rouault

FedMix: Approximation of Mixup under Mean Augmented Federated Learning

Tehrim Yoon · Sumin Shin · Sung Ju Hwang · Eunho Yang

Generalized Energy Based Models

Michael Arbel · Liang Zhou · Arthur Gretton

GraphCodeBERT: Pre-training Code Representations with Data Flow

Daya Guo · Shuo Ren · Shuai Lu · Zhangyin Feng · Duyu Tang · Shujie LIU · Long Zhou · Nan Duan · Alexey Svyatkovskiy · Shengyu Fu · Michele Tufano · Shao Kun Deng · Colin Clement · Dawn Drain · Neel Sundaresan · Jian Yin · Daxin Jiang · Ming Zhou

Group Equivariant Conditional Neural Processes

Makoto Kawano · Wataru Kumagai · Akiyoshi Sannai · Yusuke Iwasawa · Yutaka Matsuo

Hyperbolic Neural Networks++

Ryohei Shimizu · YUSUKE Mukuta · Tatsuya Harada

IDF++: Analyzing and Improving Integer Discrete Flows for Lossless Compression

Rianne van den Berg · Alexey Gritsenko · Mostafa Dehghani · Casper Sønderby · Tim Salimans

Image GANs meet Differentiable Rendering for Inverse Graphics and Interpretable 3D Neural Rendering

Yuxuan Zhang · Wenzheng Chen · Huan Ling · Jun Gao · Yinan Zhang · Antonio Torralba · Sanja Fidler

Improving Transformation Invariance in Contrastive Representation Learning

Adam Foster · Rattana Pukdee · Tom Rainforth

Intraclass clustering: an implicit learning ability that regularizes DNNs

Simon Carbonnelle · Christophe De Vleeschouwer

Learning Accurate Entropy Model with Global Reference for Image Compression

Yichen Qian · Zhiyu Tan · Xiuyu Sun · Ming Lin · Dongyang Li · Zhenhong Sun · Li Hao · Rong Jin

Learning Better Structured Representations Using Low-rank Adaptive Label Smoothing

Asish Ghoshal · Xilun Chen · Sonal Gupta · Luke Zettlemoyer · Yashar Mehdad

Learning Incompressible Fluid Dynamics from Scratch - Towards Fast, Differentiable Fluid Models that Generalize

Nils Wandel · Michael Weinmann · Reinhard Klein

Learning Subgoal Representations with Slow Dynamics

Siyuan Li · Lulu Zheng · Jianhao Wang · Chongjie Zhang

Learning the Pareto Front with Hypernetworks

Aviv Navon · Aviv Shamsian · Ethan Fetaya · Gal Chechik

Model-based micro-data reinforcement learning: what are the crucial model properties and which model to choose?

Balázs Kégl · Gabriel Hurtado · Albert Thomas

Monte-Carlo Planning and Learning with Language Action Value Estimates

Youngsoo Jang · Seokin Seo · Jongmin Lee · Kee-Eung Kim

Multiscale Score Matching for Out-of-Distribution Detection

Ahsan Mahmood · Junier Oliva · Martin A Styner

Neurally Augmented ALISTA

Freya Behrens · Jonathan Sauder · Peter Jung

not-MIWAE: Deep Generative Modelling with Missing not at Random Data

Niels Ipsen · Pierre-Alexandre Mattei · Jes Frellsen

On Self-Supervised Image Representations for GAN Evaluation

Stanislav Morozov · Andrey Voynov · Artem Babenko

Practical Real Time Recurrent Learning with a Sparse Approximation

Jacob Menick · Erich Elsen · Utku Evci · Simon Osindero · Karen Simonyan · Alex Graves

Prediction and generalisation over directed actions by grid cells

Changmin Yu · Timothy Behrens · Neil Burgess

Prototypical Contrastive Learning of Unsupervised Representations

Junnan Li · Pan Zhou · Caiming Xiong · Steven Hoi

Rao-Blackwellizing the Straight-Through Gumbel-Softmax Gradient Estimator

Max B Paulus · Chris Maddison · Andreas Krause

Refining Deep Generative Models via Discriminator Gradient Flow

Abdul Fatir Ansari · Ming Liang Ang · Harold Soh

Risk-Averse Offline Reinforcement Learning

Núria Armengol Urpí · Sebastian Curi · Andreas Krause

Spatial Dependency Networks: Neural Layers for Improved Generative Image Modeling

Đorđe Miladinović · Aleksandar Stanić · Stefan Bauer · Jürgen Schmidhuber · Joachim M Buhmann

What they do when in doubt: a study of inductive biases in seq2seq learners

Kharitonov Eugene · Rahma Chaabouni

Winning the L2RPN Challenge: Power Grid Management via Semi-Markov Afterstate Actor-Critic

Deunsol Yoon · Sunghoon Hong · Byung-Jun Lee · Kee-Eung Kim

Go to Event Page

Oral

Oral Session 4

12:00 PM - 2:58 PM

16 Events in this session

End-to-end Adversarial Text-to-Speech

Jeff Donahue · Sander Dieleman · Mikolaj Binkowski · Erich Elsen · Karen Simonyan

Autoregressive Entity Retrieval

Nicola De Cao · Gautier Izacard · Sebastian Riedel · Fabio Petroni

Interpreting Graph Neural Networks for NLP With Differentiable Edge Masking

Michael Schlichtkrull · Nicola De Cao · Ivan Titov

Expressive Power of Invariant and Equivariant Graph Neural Networks

Waïss Azizian · marc lelarge

Gauge Equivariant Mesh CNNs: Anisotropic convolutions on geometric graphs

Pim De Haan · Maurice Weiler · Taco Cohen · Max Welling

Q&A

Rao-Blackwellizing the Straight-Through Gumbel-Softmax Gradient Estimator

Max B Paulus · Chris Maddison · Andreas Krause

Scalable Learning and MAP Inference for Nonsymmetric Determinantal Point Processes

Mike Gartrell · Insu Han · Elvis Dohmatob · Jennifer Gillenwater · Victor-Emmanuel Brunel

Multivariate Probabilistic Time Series Forecasting via Conditioned Normalizing Flows

Kashif Rasul · Abdul-Saboor Sheikh · Ingmar Schuster · Urs Bergmann · Roland Vollgraf

Noise against noise: stochastic label noise helps combat inherent label noise

Pengfei Chen · Guangyong Chen · Junjie Ye · jingwei zhao · Pheng-Ann Heng

Q&A

Mutual Information State Intrinsic Control

Rui Zhao · Yang Gao · Pieter Abbeel · Volker Tresp · Wei Xu

Learning Incompressible Fluid Dynamics from Scratch - Towards Fast, Differentiable Fluid Models that Generalize

Nils Wandel · Michael Weinmann · Reinhard Klein

Identifying nonlinear dynamical systems with multiple time scales and long-range dependencies

Dominik Schmidt · Georgia Koppe · Zahra Monfared · Max Beutelspacher · Daniel Durstewitz

Fidelity-based Deep Adiabatic Scheduling

Eli Ovits · Lior Wolf

Q&A

Go to Event Page

Invited Talk

AI in Finance: Scope and Examples

Manuela Veloso

5:00 PM - 6:00 PM

AI enables principled representation of knowledge, complex strategy optimization, learning from data, and support to human decision making. I will present examples and discuss the scope of AI in our research in the finance domain.

... more

Speaker Bio

Manuela M. Veloso is the Head of J.P. Morgan AI Research, which pursues fundamental research in areas of core relevance to financial services, including data mining and cryptography, machine learning, explainability, and human-AI interaction. J.P. Morgan AI Research partners with applied data analytics teams across the firm as well as with leading academic institutions globally. Professor Veloso is on leave from Carnegie Mellon University as the Herbert A. Simon University Professor in the School of Computer Science, and the past Head of the Machine Learning Department. With her students, she had led research in AI, with a focus on robotics and machine learning, having concretely researched and developed a variety of autonomous robots, including teams of soccer robots, and mobile service robots. Her robot soccer teams have been RoboCup world champions several times, and the CoBot mobile robots have autonomously navigated for more than 1,000km in university buildings. Professor Veloso is the Past President of AAAI, (the Association for the Advancement of Artificial Intelligence), and the co-founder, Trustee, and Past President of RoboCup. Professor Veloso has been recognized with a multiple honors, including being a Fellow of the ACM, IEEE, AAAS, and AAAI. She is the recipient of several best paper awards, the Einstein Chair of the Chinese Academy of Science, the ACM/SIGART Autonomous Agents Research Award, an NSF Career Award, and the Allen Newell Medal for Excellence in Research. Professor Veloso earned a Bachelor and Master of Science degrees in Electrical and Computer Engineering from Instituto Superior Tecnico in Lisbon, Portugal, a Master of Arts in Computer Science from Boston University, and Master of Science and PhD in Computer Science from Carnegie Mellon University. See www.cs.cmu.edu/~mmv/Veloso.html for her scientific publications.

... more

Poster

Poster Session 05

6:00 PM - 8:00 PM

79 Events in this session

Are wider nets better given the same number of parameters?

Anna Golubeva · Guy Gur-Ari · Behnam Neyshabur

DC3: A learning method for optimization with hard constraints

Priya Donti · David Rolnick · Zico Kolter

DINO: A Conditional Energy-Based GAN for Domain Translation

Konstantinos Vougioukas · Stavros Petridis · Maja Pantic

Direction Matters: On the Implicit Bias of Stochastic Gradient Descent with Moderate Learning Rate

Jingfeng Wu · Difan Zou · vladimir braverman · Quanquan Gu

Large Associative Memory Problem in Neurobiology and Machine Learning

Dmitry Krotov · John J Hopfield

Quantifying Differences in Reward Functions

Adam Gleave · Michael Dennis · Shane Legg · Stuart Russell · Jan Leike

Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated Environments

Daochen Zha · Wenye Ma · Lei Yuan · Xia Hu · Ji Liu

Recurrent Independent Mechanisms

Anirudh Goyal · Alex Lamb · Jordan Hoffmann · Shagun Sodhani · Sergey Levine · Yoshua Bengio · Bernhard Schoelkopf

Reducing the Computational Cost of Deep Generative Models with Binary Neural Networks

Thomas Bird · Friso Kingma · David Barber

Representation Learning via Invariant Causal Mechanisms

Jovana Mitrovic · Brian McWilliams · Jacob C Walker · Lars Buesing · Charles Blundell

Robust Pruning at Initialization

Soufiane Hayou · Jean-Francois Ton · Arnaud Doucet · Yee Whye Teh

Shape or Texture: Understanding Discriminative Features in CNNs

Md Amirul Islam · Matthew Kowal · Patrick Esser · Sen Jia · Björn Ommer · Kosta Derpanis · Neil Bruce

Sharper Generalization Bounds for Learning with Gradient-dominated Objective Functions

Yunwen Lei · Yiming Ying

Systematic generalisation with group invariant predictions

Faruk Ahmed · Yoshua Bengio · Harm van Seijen · Aaron Courville

Tent: Fully Test-Time Adaptation by Entropy Minimization

Dequan Wang · Evan Shelhamer · Shaoteng Liu · Bruno Olshausen · trevor darrell

Text Generation by Learning from Demonstrations

Richard Yuanzhe Pang · He He

The geometry of integration in text classification RNNs

Kyle Aitken · Vinay Ramasesh · Ankush Garg · Yuan Cao · David Sussillo · Niru Maheswaranathan

Towards Resolving the Implicit Bias of Gradient Descent for Matrix Factorization: Greedy Low-Rank Learning

Zhiyuan Li · Yuping Luo · Kaifeng Lyu

Unsupervised Object Keypoint Learning using Local Spatial Predictability

Anand Gopalakrishnan · Sjoerd van Steenkiste · Jürgen Schmidhuber

VAEBM: A Symbiosis between Variational Autoencoders and Energy-based Models

Zhisheng Xiao · Karsten Kreis · Jan Kautz · Arash Vahdat

Anatomy of Catastrophic Forgetting: Hidden Representations and Task Semantics

Vinay Ramasesh · Ethan Dyer · Maithra Raghu

Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval

Lee Xiong · Chenyan Xiong · Ye Li · Kwok-Fung Tang · Jialin Liu · Paul N Bennett · Junaid Ahmed · Arnold Overwijk

Auction Learning as a Two-Player Game

Jad Rahme · Samy Jelassi · S. M Weinberg

Characterizing signal propagation to close the performance gap in unnormalized ResNets

Andrew Brock · Soham De · Samuel Smith

Clairvoyance: A Pipeline Toolkit for Medical Time Series

Daniel Jarrett · Jinsung Yoon · Ioana Bica · Zhaozhi Qian · Ari Ercole · Mihaela van der Schaar

C-Learning: Horizon-Aware Cumulative Accessibility Estimation

Panteha Naderian · Gabriel Loaiza-Ganem · Harry Braviner · Anthony Caterini · Jesse C Cresswell · Tong Li · Animesh Garg

Data-Efficient Reinforcement Learning with Self-Predictive Representations

Max Schwarzer · Ankesh Anand · Rishab Goel · R Devon Hjelm · Aaron Courville · Philip Bachman

Decoupling Global and Local Representations via Invertible Generative Flows

Xuezhe Ma · Xiang Kong · Shanghang Zhang · Eduard H Hovy

DialoGraph: Incorporating Interpretable Strategy-Graph Networks into Negotiation Dialogues

Rishabh Joshi · Vidhisha Balachandran · Shikhar Vashishth · Alan Black · Yulia Tsvetkov

Discovering a set of policies for the worst case reward

Tom Zahavy · Andre Barreto · Daniel J Mankowitz · Shaobo Hou · Brendan ODonoghue · Iurii Kemaev · Satinder Singh

Distance-Based Regularisation of Deep Networks for Fine-Tuning

Henry Gouk · Timothy Hospedales · massimiliano pontil

FairBatch: Batch Selection for Model Fairness

Yuji Roh · Kangwook Lee · Steven Whang · Changho Suh

Fair Mixup: Fairness via Interpolation

Ching-Yao Chuang · Youssef Mroueh

Getting a CLUE: A Method for Explaining Uncertainty Estimates

Javier Antorán · Umang Bhatt · Tameem Adel · Adrian Weller · José Miguel Hernández Lobato

Global optimality of softmax policy gradient with single hidden layer neural networks in the mean-field regime

Andrea Agazzi · Jianfeng Lu

How Benign is Benign Overfitting ?

Amartya Sanyal · Puneet Dokania · Varun Kanade · Philip Torr

Interpreting Knowledge Graph Relation Representation from Word Embeddings

Carl Allen · Ivana Balazevic · Timothy Hospedales

Iterative Empirical Game Solving via Single Policy Best Response

Max Smith · Thomas Anthony · Michael Wellman

Learning a Latent Search Space for Routing Problems using Variational Autoencoders

André Hottung · Bhanu Bhandari · Kevin Tierney

Learning from Protein Structure with Geometric Vector Perceptrons

Bowen Jing · Stephan Eismann · Patricia Suriana · Raphael J Townshend · Ron Dror

Learning Neural Event Functions for Ordinary Differential Equations

Tian Qi Chen · Brandon Amos · Maximilian Nickel

Learning Parametrised Graph Shift Operators

George Dasoulas · Johannes Lutzeyer · Michalis Vazirgiannis

Learning Robust State Abstractions for Hidden-Parameter Block MDPs

Amy Zhang · Shagun Sodhani · Khimya Khetarpal · Joelle Pineau

Learning Value Functions in Deep Policy Gradients using Residual Variance

Yannis Flet-Berliac · reda ouhamma · odalric-ambrym maillard · philippe preux

Mapping the Timescale Organization of Neural Language Models

Hsiang-Yun Sherry Chien · Jinhan Zhang · Christopher Honey

Meta-learning Symmetries by Reparameterization

Allan Zhou · Tom Knowles · Chelsea Finn

Nearest Neighbor Machine Translation

Urvashi Khandelwal · Angela Fan · Dan Jurafsky · Luke Zettlemoyer · Mike Lewis

NOVAS: Non-convex Optimization via Adaptive Stochastic Search for End-to-end Learning and Control

Ioannis Exarchos · Marcus A Pereira · Ziyi Wang · Evangelos Theodorou

On the Dynamics of Training Attention Models

Haoye Lu · Yongyi Mao · Amiya Nayak

On the mapping between Hopfield networks and Restricted Boltzmann Machines

Matthew Smart · Anton Zilman

On the Origin of Implicit Regularization in Stochastic Gradient Descent

Samuel Smith · Benoit Dherin · David Barrett · Soham De

On the Theory of Implicit Deep Learning: Global Convergence with Implicit Layers

Kenji Kawaguchi

Physics-aware, probabilistic model order reduction with guaranteed stability

Sebastian Kaltenbach · PS Koutsourelakis

Provable Rich Observation Reinforcement Learning with Combinatorial Latent States

Dipendra Kumar Misra · Qinghua Liu · Chi Jin · John Langford

Reinforcement Learning with Random Delays

Yann Bouteiller · Simon Ramstedt · Giovanni Beltrame · Chris J Pal · Jonathan Binas

Rethinking Attention with Performers

Krzysztof Choromanski · Valerii Likhosherstov · David Dohan · Xingyou Song · Georgiana-Andreea Gane · Tamas Sarlos · Peter Hawkins · Jared Q Davis · Afroz Mohiuddin · Lukasz Kaiser · David Belanger · Lucy J Colwell · Adrian Weller

Ringing ReLUs: Harmonic Distortion Analysis of Nonlinear Feedforward Networks

Christian Ali Mehmeti-Göpel · David Hartmann · Michael Wand

Scalable Bayesian Inverse Reinforcement Learning

Alex Chan · Mihaela van der Schaar

Self-Supervised Learning of Compressed Video Representations

Youngjae Yu · Sangho Lee · Gunhee Kim · Yale Song

Single-Timescale Actor-Critic Provably Finds Globally Optimal Policy

Zuyue Fu · Zhuoran Yang · Zhaoran Wang

SMiRL: Surprise Minimizing Reinforcement Learning in Unstable Environments

Glen Berseth · Daniel Geng · Coline M Devin · Nicholas Rhinehart · Chelsea Finn · Dinesh Jayaraman · Sergey Levine

SSD: A Unified Framework for Self-Supervised Outlier Detection

Vikash Sehwag · Mung Chiang · Prateek Mittal

Statistical inference for individual fairness

Subha Maity · Songkai Xue · Mikhail Yurochkin · Yuekai Sun

Supervised Contrastive Learning for Pre-trained Language Model Fine-tuning

Beliz Gunel · Jingfei Du · Alexis Conneau · Veselin Stoyanov

Support-set bottlenecks for video-text representation learning

Mandela Patrick · Po-Yao Huang · Yuki Asano · Florian Metze · Alexander G Hauptmann · Joao F. Henriques · Andrea Vedaldi

Taming GANs with Lookahead-Minmax

Tatjana Chavdarova · Matteo Pagliardini · Sebastian Stich · François Fleuret · Martin Jaggi

Teaching with Commentaries

Aniruddh Raghu · Maithra Raghu · Simon Kornblith · David Duvenaud · Geoffrey Hinton

Towards Faster and Stabilized GAN Training for High-fidelity Few-shot Image Synthesis

Bingchen Liu · Yizhe Zhu · Kunpeng Song · Ahmed Elgammal

Tradeoffs in Data Augmentation: An Empirical Study

Raphael Gontijo Lopes · Sylvia Smullin · Ekin Cubuk · Ethan Dyer

Training BatchNorm and Only BatchNorm: On the Expressive Power of Random Features in CNNs

Jonathan Frankle · David J Schwab · Ari Morcos

Transformer protein language models are unsupervised structure learners

Roshan Rao · Joshua Meier · Tom Sercu · Sergey Ovchinnikov · Alexander Rives

Transient Non-stationarity and Generalisation in Deep Reinforcement Learning

Maximilian Igl · Gregory Farquhar · Jelena Luketina · Wendelin Boehmer · Shimon Whiteson

UMEC: Unified model and embedding compression for efficient recommendation systems

Jiayi Shen · Haotao Wang · Shupeng Gui · Jianchao Tan · Zhangyang Wang · Ji Liu

Uncertainty-aware Active Learning for Optimal Bayesian Classifier

Guang Zhao · Edward Dougherty · Byung-Jun Yoon · Francis Alexander · Xiaoning Qian

Uncertainty Estimation in Autoregressive Structured Prediction

Andrey Malinin · Mark Gales

Understanding Over-parameterization in Generative Adversarial Networks

Yogesh Balaji · Mohammadmahdi Sajedi · Neha Kalibhat · Mucong Ding · Dominik Stöger · Mahdi Soltanolkotabi · Soheil Feizi

Unsupervised Representation Learning for Time Series with Temporal Neighborhood Coding

Sana Tonekaboni · Danny Eytan · Anna Goldenberg

VTNet: Visual Transformer Network for Object Goal Navigation

Heming Du · Xin Yu · Liang Zheng

Vulnerability-Aware Poisoning Mechanism for Online RL with Unknown Dynamics

Yanchao Sun · Da Huo · Furong Huang

Go to Event Page

Oral

Oral Session 5

8:00 PM - 10:56 PM

15 Events in this session

Iterated learning for emergent systematicity in VQA

Ankit Vani · Max Schwarzer · Yuchen Lu · Eeshan Dhekane · Aaron Courville

Learning Generalizable Visual Representations via Interactive Gameplay

Luca Weihs · Aniruddha Kembhavi · Kiana Ehsani · Sarah M Pratt · Winson Han · Alvaro Herrasti · Eric Kolve · Dustin Schwenk · Roozbeh Mottaghi · Ali Farhadi

How Does Mixup Help With Robustness and Generalization?

Linjun Zhang · Zhun Deng · Kenji Kawaguchi · Amirata Ghorbani · James Zou

Recurrent Independent Mechanisms

Anirudh Goyal · Alex Lamb · Jordan Hoffmann · Shagun Sodhani · Sergey Levine · Yoshua Bengio · Bernhard Schoelkopf

Q&A

Randomized Automatic Differentiation

Deniz Oktay · Nick McGreivy · Joshua Aduol · Alex Beatson · Ryan P Adams

Image GANs meet Differentiable Rendering for Inverse Graphics and Interpretable 3D Neural Rendering

Yuxuan Zhang · Wenzheng Chen · Huan Ling · Jun Gao · Yinan Zhang · Antonio Torralba · Sanja Fidler

Mind the Pad -- CNNs Can Develop Blind Spots

Bilal Alsallakh · Narine Kokhlikyan · Vivek Miglani · Jun Yuan · Orion Reblitz-Richardson

Implicit Convex Regularizers of CNN Architectures: Convex Optimization of Two- and Three-Layer Networks in Polynomial Time

Tolga Ergen · Mert Pilanci

Learning from Protein Structure with Geometric Vector Perceptrons

Bowen Jing · Stephan Eismann · Patricia Suriana · Raphael J Townshend · Ron Dror

Q&A

On the mapping between Hopfield networks and Restricted Boltzmann Machines

Matthew Smart · Anton Zilman

Learning-based Support Estimation in Sublinear Time

Talya Eden · Piotr Indyk · Shyam Narayanan · Ronitt Rubinfeld · Sandeep Silwal · Tal Wagner

Long-tail learning via logit adjustment

Aditya Krishna Menon · Sadeep Jayasumana · Ankit Singh Rawat · Himanshu Jain · Andreas Veit · Sanjiv Kumar

Q&A

Go to Event Page

Expo Talk Panel

Interpretability with skeptical and user-centric mind

Been Kim

11:00 PM - 12:00 AM

Interpretable machine learning has been a popular topic of study in the era of machine learning. But are we making progress? Are we heading in the right direction? In this talk, I start with a skeptically-minded journey of this field on our past-selves, before moving on to recent developments of more user-focused methods. The talk will finish with where we are heading, and a number of open questions that we should think about.

... more