English
Related papers

Related papers: Predict then Interpolate: A Simple Algorithm to Le…

200 papers

Many modern machine learning models are trained to achieve zero or near-zero training error in order to obtain near-optimal (but non-zero) test error. This phenomenon of strong generalization performance for "overfitted" / interpolated…

Machine Learning · Statistics 2018-10-29 Mikhail Belkin , Daniel Hsu , Partha Mitra

Recent advances in deep learning have significantly elevated weather prediction models. However, these models often falter in real-world scenarios due to their sensitivity to spatial-temporal shifts. This issue is particularly acute in…

Machine Learning · Computer Science 2023-12-04 Lu Han , Xu-Yang Chen , Han-Jia Ye , De-Chuan Zhan

Learned classifiers should often possess certain invariance properties meant to encourage fairness, robustness, or out-of-distribution generalization. However, multiple recent works empirically demonstrate that common invariance-inducing…

Machine Learning · Computer Science 2024-07-04 Yoav Wald , Gal Yona , Uri Shalit , Yair Carmon

We introduce Interpolation Consistency Training (ICT), a simple and computation efficient algorithm for training Deep Neural Networks in the semi-supervised learning paradigm. ICT encourages the prediction at an interpolation of unlabeled…

Machine Learning · Statistics 2022-10-20 Vikas Verma , Kenji Kawaguchi , Alex Lamb , Juho Kannala , Arno Solin , Yoshua Bengio , David Lopez-Paz

A probability forecast or probabilistic classifier is reliable or calibrated if the predicted probabilities are matched by ex post observed frequencies, as examined visually in reliability diagrams. The classical binning and counting…

Methodology · Statistics 2021-08-26 Timo Dimitriadis , Tilmann Gneiting , Alexander I. Jordan

Machine learning can benefit from causal discovery for interpretation and from causal inference for generalization. In this line of research, a few invariant learning algorithms for out-of-distribution (OOD) generalization have been…

Machine Learning · Computer Science 2023-04-06 Borja Guerrero Santillan

Predictive coding networks are neuroscience-inspired models with roots in both Bayesian statistics and neuroscience. Training such models, however, is quite inefficient and unstable. In this work, we show how by simply changing the temporal…

Neural and Evolutionary Computing · Computer Science 2024-02-08 Tommaso Salvatori , Yuhang Song , Yordan Yordanov , Beren Millidge , Zhenghua Xu , Lei Sha , Cornelius Emde , Rafal Bogacz , Thomas Lukasiewicz

We study the problem of learning causal models from observational data through the lens of interpolation and its counterpart -- regularization. A large volume of recent theoretical, as well as empirical work, suggests that, in highly…

Machine Learning · Statistics 2022-02-21 Leena Chennuru Vankadara , Luca Rendsburg , Ulrike von Luxburg , Debarghya Ghoshdastidar

In several real world applications, machine learning models are deployed to make predictions on data whose distribution changes gradually along time, leading to a drift between the train and test distributions. Such models are often…

Machine Learning · Computer Science 2021-11-23 Anshul Nasery , Soumyadeep Thakur , Vihari Piratla , Abir De , Sunita Sarawagi

While unbiased machine learning models are essential for many applications, bias is a human-defined concept that can vary across tasks. Given only input-label pairs, algorithms may lack sufficient information to distinguish stable (causal)…

Machine Learning · Computer Science 2022-06-28 Yujia Bao , Shiyu Chang , Regina Barzilay

A critical need for industrial recommender systems is the ability to evaluate recommendation policies offline, before deploying them to production. Unfortunately, widely used off-policy evaluation methods either make strong assumptions…

Machine Learning · Computer Science 2022-10-19 Alexander Buchholz , Ben London , Giuseppe di Benedetto , Thorsten Joachims

Contextual optimization, also known as predict-then-optimize or prescriptive analytics, considers an optimization problem with the presence of covariates (context or side information). The goal is to learn a prediction model (from the…

Optimization and Control · Mathematics 2024-05-13 Chunlin Sun , Linyu Liu , Xiaocheng Li

The work aims to stabilize the unstable index-1 descriptor systems by Riccati-based feedback stabilization via a modified form of Iterative Rational Krylov Algorithm (IRKA), which is a bi-tangential interpolation-based technique. In the…

Optimization and Control · Mathematics 2021-03-31 Mahtab Uddin , M. Monir Uddin , M. A. H. Khan , Md. Motlubar Rahman

Within the machine learning community, the widely-used uniform convergence framework has been used to answer the question of how complex, over-parameterized models can generalize well to new data. This approach bounds the test error of the…

Machine Learning · Statistics 2021-03-05 Ryan Theisen , Jason M. Klusowski , Michael W. Mahoney

This paper gives a general interpretation of Linear Prediction (LP) by interpolation framework different from the perspective of statistics. This interpretation is proved to be useful by several following results, such as: The mechanism of…

Signal Processing · Electrical Eng. & Systems 2019-05-21 Changcun Huang

We propose a network architecture capable of reliably estimating uncertainty of regression based predictions without sacrificing accuracy. The current state-of-the-art uncertainty algorithms either fall short of achieving prediction…

Machine Learning · Computer Science 2022-02-22 Kinjal Patel , Steven Waslander

We examine the necessity of interpolation in overparameterized models, that is, when achieving optimal predictive risk in machine learning problems requires (nearly) interpolating the training data. In particular, we consider simple…

Machine Learning · Statistics 2022-06-17 Chen Cheng , John Duchi , Rohith Kuditipudi

Pursuing invariant prediction from heterogeneous environments opens the door to learning causality in a purely data-driven way and has several applications in causal discovery and robust transfer learning. However, existing methods such as…

Statistics Theory · Mathematics 2025-01-30 Yihong Gu , Cong Fang , Yang Xu , Zijian Guo , Jianqing Fan

Healthcare data often come from multiple sites in which the correlations between confounding variables can vary widely. If deep learning models exploit these unstable correlations, they might fail catastrophically in unseen sites. Although…

Machine Learning · Computer Science 2023-10-25 Minh Nguyen , Alan Q. Wang , Heejong Kim , Mert R. Sabuncu

Prediction intervals offer an effective tool for quantifying the uncertainty of loads in distribution systems. The traditional central PIs cannot adapt well to skewed distributions, and their offline training fashion is vulnerable to…

Applications · Statistics 2023-11-30 Yufan Zhang , Honglin Wen , Qiuwei Wu , Qian Ai
‹ Prev 1 2 3 10 Next ›