1

Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization
Combinatorial optimization has found applications in numerous fields, from aerospace to transportation planning and economics. The goal …
A Weakly Supervised Consistency-based Learning Method for COVID-19 Segmentation in CT Images
Coronavirus Disease 2019 (COVID-19) has spread aggressively across the world causing an existential health crisis. Thus, having a …
Learning Data Augmentation with Online Bilevel Optimization for Image Classification
Data augmentation is a key practice in machine learning for improving generalization performance. However, finding the best data …
Overnet: Lightweight multi-scale super-resolution with overscaling network
Super-resolution (SR) has achieved great success due to the development of deep convolutional neural networks (CNNs). However, as the …
Adversarial Soft Advantage Fitting: Imitation Learning without Policy Optimization
Adversarial Imitation Learning alternates between learning a discriminator – which tells apart expert’s demonstrations from …
An empirical study of loss landscape geometry and evolution of the data-dependent Neural Tangent Kernel
In suitably initialized wide networks, small learning rates transform deep neural networks (DNNs) into neural tangent kernel (NTK) …
Differentiable Causal Discovery from Interventional Data
Learning a causal directed acyclic graph from data is a challenging task that involves solving a combinatorial problem for which the …
In search of robust measures of generalization
One of the principal scientific challenges in deep learning is explaining generalization, i.e., why the particular way the community …
Measuring Systematic Generalization in Neural Proof Generation with Transformers
We are interested in understanding how well Transformer language models (TLMs) can perform reasoning tasks when trained on knowledge …
Online Fast Adaptation and Knowledge Accumulation: a New Approach to Continual Learning
Continual learning studies agents that learn from streams of tasks without forgetting previous ones while adapting to new ones. Two …