Related papers: Interpolation and Regularization for Causal Learni…

Generalizable Information Theoretic Causal Representation

It is evidence that representation learning can improve model's performance over multiple downstream tasks in many real-world scenarios, such as image classification and recommender systems. Existing learning approaches rely on establishing…

Machine Learning · Computer Science 2022-02-18 Mengyue Yang , Xinyu Cai , Furui Liu , Xu Chen , Zhitang Chen , Jianye Hao , Jun Wang

Memorize to Generalize: on the Necessity of Interpolation in High Dimensional Linear Regression

We examine the necessity of interpolation in overparameterized models, that is, when achieving optimal predictive risk in machine learning problems requires (nearly) interpolating the training data. In particular, we consider simple…

Machine Learning · Statistics 2022-06-17 Chen Cheng , John Duchi , Rohith Kuditipudi

Regularizing Adversarial Imitation Learning Using Causal Invariance

Imitation learning methods are used to infer a policy in a Markov decision process from a dataset of expert demonstrations by minimizing a divergence measure between the empirical state occupancy measures of the expert and the policy. The…

Machine Learning · Computer Science 2023-08-21 Ivan Ovinnikov , Joachim M. Buhmann

Implicit Regularization Leads to Benign Overfitting for Sparse Linear Regression

In deep learning, often the training process finds an interpolator (a solution with 0 training loss), but the test loss is still low. This phenomenon, known as benign overfitting, is a major mystery that received a lot of recent attention.…

Machine Learning · Computer Science 2023-05-29 Mo Zhou , Rong Ge

Malign Overfitting: Interpolation Can Provably Preclude Invariance

Learned classifiers should often possess certain invariance properties meant to encourage fairness, robustness, or out-of-distribution generalization. However, multiple recent works empirically demonstrate that common invariance-inducing…

Machine Learning · Computer Science 2024-07-04 Yoav Wald , Gal Yona , Uri Shalit , Yair Carmon

Good Classifiers are Abundant in the Interpolating Regime

Within the machine learning community, the widely-used uniform convergence framework has been used to answer the question of how complex, over-parameterized models can generalize well to new data. This approach bounds the test error of the…

Machine Learning · Statistics 2021-03-05 Ryan Theisen , Jason M. Klusowski , Michael W. Mahoney

Discovering Reliable Causal Rules

We study the problem of deriving policies, or rules, that when enacted on a complex system, cause a desired outcome. Absent the ability to perform controlled experiments, such rules have to be inferred from past observations of the system's…

Machine Learning · Computer Science 2020-09-09 Kailash Budhathoki , Mario Boley , Jilles Vreeken

Specify Robust Causal Representation from Mixed Observations

Learning representations purely from observations concerns the problem of learning a low-dimensional, compact representation which is beneficial to prediction models. Under the hypothesis that the intrinsic latent factors follow some casual…

Machine Learning · Computer Science 2023-10-24 Mengyue Yang , Xinyu Cai , Furui Liu , Weinan Zhang , Jun Wang

An Introduction to Proximal Causal Learning

A standard assumption for causal inference from observational data is that one has measured a sufficiently rich set of covariates to ensure that within covariate strata, subjects are exchangeable across observed treatment values. Skepticism…

Methodology · Statistics 2020-09-24 Eric J Tchetgen Tchetgen , Andrew Ying , Yifan Cui , Xu Shi , Wang Miao

A kinetic-based regularization method for data science applications

We propose a physics-based regularization technique for function learning, inspired by statistical mechanics. By drawing an analogy between optimizing the parameters of an interpolator and minimizing the energy of a system, we introduce…

Machine Learning · Computer Science 2025-08-20 Abhisek Ganguly , Alessandro Gabbana , Vybhav Rao , Sauro Succi , Santosh Ansumali

A Statistical Theory of Regularization-Based Continual Learning

We provide a statistical analysis of regularization-based continual learning on a sequence of linear regression tasks, with emphasis on how different regularization terms affect the model performance. We first derive the convergence rate…

Machine Learning · Computer Science 2024-06-11 Xuyang Zhao , Huiyuan Wang , Weiran Huang , Wei Lin

Causal Regularization

I argue that regularizing terms in standard regression methods not only help against overfitting finite data, but sometimes also yield better causal models in the infinite sample regime. I first consider a multi-dimensional variable…

Machine Learning · Statistics 2019-07-01 Dominik Janzing

Causal Regularization

In application domains such as healthcare, we want accurate predictive models that are also causally interpretable. In pursuit of such models, we propose a causal regularizer to steer predictive models towards causally-interpretable…

Machine Learning · Computer Science 2017-02-24 Mohammad Taha Bahadori , Krzysztof Chalupka , Edward Choi , Robert Chen , Walter F. Stewart , Jimeng Sun

Understanding and Improving Interpolation in Autoencoders via an Adversarial Regularizer

Autoencoders provide a powerful framework for learning compressed representations by encoding all of the information needed to reconstruct a data point in a latent code. In some cases, autoencoders can "interpolate": By decoding the convex…

Machine Learning · Computer Science 2018-07-25 David Berthelot , Colin Raffel , Aurko Roy , Ian Goodfellow

Causal Imitation Learning with Unobserved Confounders

One of the common ways children learn is by mimicking adults. Imitation learning focuses on learning policies with suitable performance from demonstrations generated by an expert, with an unspecified performance measure, and unobserved…

Machine Learning · Computer Science 2022-08-15 Junzhe Zhang , Daniel Kumor , Elias Bareinboim

Deep Learning Generalization, Extrapolation, and Over-parameterization

We study the generalization of over-parameterized deep networks (for image classification) in relation to the convex hull of their training sets. Despite their great success, generalization of deep networks is considered a mystery. These…

Machine Learning · Computer Science 2022-03-22 Roozbeh Yousefzadeh

Combining Interventional and Observational Data Using Causal Reductions

Unobserved confounding is one of the main challenges when estimating causal effects. We propose a causal reduction method that, given a causal model, replaces an arbitrary number of possibly high-dimensional latent confounders with a single…

Machine Learning · Statistics 2023-02-24 Maximilian Ilse , Patrick Forré , Max Welling , Joris M. Mooij

Generalization in anti-causal learning

The ability to learn and act in novel situations is still a prerogative of animate intelligence, as current machine learning methods mostly fail when moving beyond the standard i.i.d. setting. What is the reason for this discrepancy? Most…

Machine Learning · Computer Science 2018-12-04 Niki Kilbertus , Giambattista Parascandolo , Bernhard Schölkopf

Regularizing towards Causal Invariance: Linear Models with Proxies

We propose a method for learning linear models whose predictive performance is robust to causal interventions on unobserved variables, when noisy proxies of those variables are available. Our approach takes the form of a regularization term…

Machine Learning · Computer Science 2021-06-29 Michael Oberst , Nikolaj Thams , Jonas Peters , David Sontag

Overfitting or perfect fitting? Risk bounds for classification and regression rules that interpolate

Many modern machine learning models are trained to achieve zero or near-zero training error in order to obtain near-optimal (but non-zero) test error. This phenomenon of strong generalization performance for "overfitted" / interpolated…

Machine Learning · Statistics 2018-10-29 Mikhail Belkin , Daniel Hsu , Partha Mitra