Related papers: Overparameterized Multiple Linear Regression as Hy…

Understanding Pathologies of Deep Heteroskedastic Regression

Deep, overparameterized regression models are notorious for their tendency to overfit. This problem is exacerbated in heteroskedastic models, which predict both mean and residual noise for each data point. At one extreme, these models fit…

Machine Learning · Statistics 2024-02-15 Eliot Wong-Toi , Alex Boyd , Vincent Fortuin , Stephan Mandt

Semiparametric Regression using Variational Approximations

Semiparametric regression offers a flexible framework for modeling non-linear relationships between a response and covariates. A prime example are generalized additive models where splines (say) are used to approximate non-linear functional…

Statistics Theory · Mathematics 2018-10-05 Francis K. C. Hui , Chong You , Han Lin Shang , Samuel Müller

Theoretical Insights into Overparameterized Models in Multi-Task and Replay-Based Continual Learning

Multi-task learning (MTL) is a machine learning paradigm that aims to improve the generalization performance of a model on multiple related tasks by training it simultaneously on those tasks. Unlike MTL, where the model has instant access…

Machine Learning · Computer Science 2025-03-21 Amin Banayeeanzade , Mahdi Soltanolkotabi , Mohammad Rostami

Employing an Adjusted Stability Measure for Multi-Criteria Model Fitting on Data Sets with Similar Features

Fitting models with high predictive accuracy that include all relevant but no irrelevant or redundant features is a challenging task on data sets with similar (e.g. highly correlated) features. We propose the approach of tuning the…

Machine Learning · Statistics 2022-03-23 Andrea Bommert , Jörg Rahnenführer , Michel Lang

Double Machine Learning for Partially Linear Mixed-Effects Models with Repeated Measurements

Traditionally, spline or kernel approaches in combination with parametric estimation are used to infer the linear coefficient (fixed effects) in a partially linear mixed-effects model for repeated measurements. Using machine learning…

Methodology · Statistics 2023-04-03 Corinne Emmenegger , Peter Bühlmann

Fitting very flexible models: Linear regression with large numbers of parameters

There are many uses for linear fitting; the context here is interpolation and denoising of data, as when you have calibration data and you want to fit a smooth, flexible function to those data. Or you want to fit a flexible function to…

Data Analysis, Statistics and Probability · Physics 2021-09-22 David W. Hogg , Soledad Villar

Overparameterized Linear Regression under Adversarial Attacks

We study the error of linear regression in the face of adversarial attacks. In this framework, an adversary changes the input to the regression model in order to maximize the prediction error. We provide bounds on the prediction error in…

Machine Learning · Statistics 2023-03-29 Antônio H. Ribeiro , Thomas B. Schön

A New Look at an Old Problem: A Universal Learning Approach to Linear Regression

Linear regression is a classical paradigm in statistics. A new look at it is provided via the lens of universal learning. In applying universal learning to linear regression the hypotheses class represents the label $y\in {\cal R}$ as a…

Machine Learning · Computer Science 2019-11-11 Koby Bibas , Yaniv Fogel , Meir Feder

Towards Data-Algorithm Dependent Generalization: a Case Study on Overparameterized Linear Regression

One of the major open problems in machine learning is to characterize generalization in the overparameterized regime, where most traditional generalization bounds become inconsistent even for overparameterized linear regression. In many…

Machine Learning · Computer Science 2023-11-22 Jing Xu , Jiaye Teng , Yang Yuan , Andrew Chi-Chih Yao

Theoretical Characterization of the Generalization Performance of Overfitted Meta-Learning

Meta-learning has arisen as a successful method for improving training performance by training over many similar tasks, especially with deep neural networks (DNNs). However, the theoretical understanding of when and why overparameterized…

Machine Learning · Computer Science 2023-04-11 Peizhong Ju , Yingbin Liang , Ness B. Shroff

Bayesian Inference for Consistent Predictions in Overparameterized Nonlinear Regression

The remarkable generalization performance of large-scale models has been challenging the conventional wisdom of the statistical learning theory. Although recent theoretical studies have shed light on this behavior in linear models and…

Machine Learning · Statistics 2024-06-18 Tomoya Wakayama

Uniform-in-Submodel Bounds for Linear Regression in a Model Free Framework

For the last two decades, high-dimensional data and methods have proliferated throughout the literature. Yet, the classical technique of linear regression has not lost its usefulness in applications. In fact, many high-dimensional…

Statistics Theory · Mathematics 2021-05-18 Arun Kumar Kuchibhotla , Lawrence D. Brown , Andreas Buja , Edward I. George , Linda Zhao

Nearly Minimal Over-Parametrization of Shallow Neural Networks

A recent line of work has shown that an overparametrized neural network can perfectly fit the training data, an otherwise often intractable nonconvex optimization problem. For (fully-connected) shallow networks, in the best case scenario,…

Machine Learning · Computer Science 2019-10-30 Armin Eftekhari , ChaeHwan Song , Volkan Cevher

Replica analysis of overfitting in generalized linear models

Nearly all statistical inference methods were developed for the regime where the number $N$ of data samples is much larger than the data dimension $p$. Inference protocols such as maximum likelihood (ML) or maximum a posteriori probability…

Disordered Systems and Neural Networks · Physics 2020-07-09 ACC Coolen , M Sheikh , A Mozeika , F Aguirre-Lopez , F Antenucci

Parameter Estimation of Nonlinearly Parameterized Regressions without Overparameterization nor Persistent Excitation: Application to System Identification and Adaptive Control

In this paper we propose a solution to the problem of parameter estimation of nonlinearly parameterized regressions--continuous or discrete time--and apply it for system identification and adaptive control. We restrict our attention to…

Optimization and Control · Mathematics 2019-10-18 Romeo Ortega , Vladislav Gromov , Emmanuel Nuño , Anton Pyrkin , Jose Guadalupe Romero

On Generalization of Adaptive Methods for Over-parameterized Linear Regression

Over-parameterization and adaptive methods have played a crucial role in the success of deep learning in the last decade. The widespread use of over-parameterization has forced us to rethink generalization by bringing forth new phenomena,…

Machine Learning · Statistics 2020-12-01 Vatsal Shah , Soumya Basu , Anastasios Kyrillidis , Sujay Sanghavi

General bound of overfitting for MLP regression models

Multilayer perceptrons (MLP) with one hidden layer have been used for a long time to deal with non-linear regression. However, in some task, MLP's are too powerful models and a small mean square error (MSE) may be more due to overfitting…

Statistics Theory · Mathematics 2012-05-10 Joseph Rynkiewicz

MUSO: Achieving Exact Machine Unlearning in Over-Parameterized Regimes

Machine unlearning (MU) is to make a well-trained model behave as if it had never been trained on specific data. In today's over-parameterized models, dominated by neural networks, a common approach is to manually relabel data and fine-tune…

Machine Learning · Computer Science 2025-07-21 Ruikai Yang , Mingzhen He , Zhengbao He , Youmei Qiu , Xiaolin Huang

A connection between the pattern classification problem and the General Linear Model for statistical inference

A connection between the General Linear Model (GLM) in combination with classical statistical inference and the machine learning (MLE)-based inference is described in this paper. Firstly, the estimation of the GLM parameters is expressed as…

Machine Learning · Statistics 2022-02-10 Juan Manuel Gorriz , SIPBA group , John Suckling

Learning Mixtures of Linear Regressions with Nearly Optimal Complexity

Mixtures of Linear Regressions (MLR) is an important mixture model with many applications. In this model, each observation is generated from one of the several unknown linear regression components, where the identity of the generated…

Machine Learning · Computer Science 2020-03-31 Yuanzhi Li , Yingyu Liang