Related papers: Pathological Regularization Regimes in Classificat…

Free Dynamics of Feature Learning Processes

Regression models usually tend to recover a noisy signal in the form of a combination of regressors, also called features in machine learning, themselves being the result of a learning process.The alignment of the prior covariance feature…

Statistical Mechanics · Physics 2023-01-25 Cyril Furtlehner

Optimal Regularization for Performative Learning

In performative learning, the data distribution reacts to the deployed model - for example, because strategic users adapt their features to game it - which creates a more complex dynamic than in classical supervised learning. One should…

Machine Learning · Computer Science 2025-10-15 Edwige Cyffers , Alireza Mirrokni , Marco Mondelli

Regularized Linear Regression for Binary Classification

Regularized linear regression is a promising approach for binary classification problems in which the training set has noisy labels since the regularization term can help to avoid interpolating the mislabeled data points. In this paper we…

Machine Learning · Computer Science 2023-11-07 Danil Akhtiamov , Reza Ghane , Babak Hassibi

Bayesian Sampling Bias Correction: Training with the Right Loss Function

We derive a family of loss functions to train models in the presence of sampling bias. Examples are when the prevalence of a pathology differs from its sampling rate in the training dataset, or when a machine learning practioner rebalances…

Machine Learning · Computer Science 2020-06-25 L. Le Folgoc , V. Baltatzis , A. Alansary , S. Desai , A. Devaraj , S. Ellis , O. E. Martinez Manzanera , F. Kanavati , A. Nair , J. Schnabel , B. Glocker

Multi-category Angle-based Classifier Refit

Classification is an important statistical learning tool. In real application, besides high prediction accuracy, it is often desirable to estimate class conditional probabilities for new observations. For traditional problems where the…

Statistics Theory · Mathematics 2025-03-18 Guo Xian Yau , Chong Zhang

Classification as Direction Recovery: Improved Guarantees via Scale Invariance

Modern algorithms for binary classification rely on an intermediate regression problem for computational tractability. In this paper, we establish a geometric distinction between classification and regression that allows risk in these two…

Machine Learning · Statistics 2022-05-19 Suhas Vijaykumar , Claire Lazar Reich

Iterative regularization in classification via hinge loss diagonal descent

Iterative regularization is a classic idea in regularization theory, that has recently become popular in machine learning. On the one hand, it allows to design efficient algorithms controlling at the same time numerical and statistical…

Machine Learning · Statistics 2024-10-10 Vassilis Apidopoulos , Tomaso Poggio , Lorenzo Rosasco , Silvia Villa

A Statistical Theory of Regularization-Based Continual Learning

We provide a statistical analysis of regularization-based continual learning on a sequence of linear regression tasks, with emphasis on how different regularization terms affect the model performance. We first derive the convergence rate…

Machine Learning · Computer Science 2024-06-11 Xuyang Zhao , Huiyuan Wang , Weiran Huang , Wei Lin

Model selection of polynomial kernel regression

Polynomial kernel regression is one of the standard and state-of-the-art learning strategies. However, as is well known, the choices of the degree of polynomial kernel and the regularization parameter are still open in the realm of model…

Machine Learning · Computer Science 2023-06-14 Shaobo Lin , Xingping Sun , Zongben Xu , Jinshan Zeng

Learned Regularization for Inverse Problems: Insights from a Spectral Model

In this chapter we provide a theoretically founded investigation of state-of-the-art learning approaches for inverse problems from the point of view of spectral reconstruction operators. We give an extended definition of regularization…

Numerical Analysis · Mathematics 2024-06-05 Martin Burger , Samira Kabri

On Inverse Problems, Parameter Estimation, and Domain Generalization

Signal restoration and inverse problems are key elements in most real-world data science applications. In the past decades, with the emergence of machine learning methods, inversion of measurements has become a popular step in almost all…

Information Theory · Computer Science 2026-04-21 Deborah Pereg

A semi-automatic method to guide the choice of ridge parameter in ridge regression

We consider the application of a popular penalised regression method, Ridge Regression, to data with very high dimensions and many more covariates than observations. Our motivation is the problem of out-of-sample prediction and the setting…

Applications · Statistics 2012-05-04 Erika Cule , Maria De Iorio

A Probabilistic Perspective on Model Collapse

In recent years, model collapse has become a critical issue in language model training, making it essential to understand the underlying mechanisms driving this phenomenon. In this paper, we investigate recursive parametric model training…

Machine Learning · Statistics 2025-05-23 Shirong Xu , Hengzhi He , Guang Cheng

Distribution-dependent Generalization Bounds for Tuning Linear Regression Across Tasks

Modern regression problems often involve high-dimensional data and a careful tuning of the regularization hyperparameters is crucial to avoid overly complex models that may overfit the training data while guaranteeing desirable properties…

Machine Learning · Computer Science 2026-04-08 Maria-Florina Balcan , Saumya Goyal , Dravyansh Sharma

A Linear Approach to Data Poisoning

Backdoor and data-poisoning attacks can flip predictions with tiny training corruptions, yet a sharp theory linking poisoning strength, overparameterization, and regularization is lacking. We analyze ridge least squares with an unpenalized…

Machine Learning · Statistics 2026-01-06 Donald Flynn , Diego Granziol

The Choice of Normalization Influences Shrinkage in Regularized Regression

Regularized models are often sensitive to the scales of the features in the data and it has therefore become standard practice to normalize (center and scale) the features before fitting the model. But there are many different ways to…

Machine Learning · Statistics 2025-07-04 Johan Larsson , Jonas Wallin

Asymptotics of Ridge (less) Regression under General Source Condition

We analyze the prediction error of ridge regression in an asymptotic regime where the sample size and dimension go to infinity at a proportional rate. In particular, we consider the role played by the structure of the true regression…

Statistics Theory · Mathematics 2021-03-09 Dominic Richards , Jaouad Mourtada , Lorenzo Rosasco

Scaling and renormalization in high-dimensional regression

From benign overfitting in overparameterized models to rich power-law scalings in performance, simple ridge regression displays surprising behaviors sometimes thought to be limited to deep neural networks. This balance of phenomenological…

Machine Learning · Statistics 2026-05-08 Alexander Atanasov , Jacob A. Zavatone-Veth , Cengiz Pehlevan

Does Regression Produce Representative Causal Rankings?

We examine the challenges in ranking multiple treatments based on their estimated effects when using linear regression or its popular double-machine-learning variant, the Partially Linear Model (PLM), in the presence of treatment effect…

Econometrics · Economics 2024-11-06 Apoorva Lal

Bias Correction for Regularized Regression and its Application in Learning with Streaming Data

We propose an approach to reduce the bias of ridge regression and regularization kernel network. When applied to a single data set the new algorithms have comparable learning performance with the original ones. When applied to incremental…

Machine Learning · Statistics 2016-03-17 Qiang Wu