Related papers: Information-Corrected Estimation: A Generalization…

Bayes and Biased Estimators Without Hyper-parameter Estimation: Comparable Performance to the Empirical-Bayes-Based Regularized Estimator

Regularized system identification has become a significant complement to more classical system identification. It has been numerically shown that kernel-based regularized estimators often perform better than the maximum likelihood estimator…

Machine Learning · Statistics 2025-03-18 Yue Ju , Bo Wahlberg , Håkan Hjalmarsson

A Statistical Theory of Regularization-Based Continual Learning

We provide a statistical analysis of regularization-based continual learning on a sequence of linear regression tasks, with emphasis on how different regularization terms affect the model performance. We first derive the convergence rate…

Machine Learning · Computer Science 2024-06-11 Xuyang Zhao , Huiyuan Wang , Weiran Huang , Wei Lin

Information-theoretic Generalization Analysis for Expected Calibration Error

While the expected calibration error (ECE), which employs binning, is widely adopted to evaluate the calibration performance of machine learning models, theoretical understanding of its estimation bias is limited. In this paper, we present…

Machine Learning · Computer Science 2025-05-27 Futoshi Futami , Masahiro Fujisawa

Machine-learning-informed parameter estimation improves the reliability of spinal cord diffusion MRI

Purpose: We address the challenge of inaccurate parameter estimation in diffusion MRI when the signal-to-noise ratio (SNR) is very low, as in the spinal cord. The accuracy of conventional maximum-likelihood estimation (MLE) depends highly…

Medical Physics · Physics 2023-01-31 Ting Gong , Francesco Grussu , Claudia A. M. Gandini Wheeler-Kingshott , Daniel C Alexander , Hui Zhang

Finite-Sample Risk Approximation and Risk-Consistent Tuning for Generalized Ridge Estimation in Nonlinear Models: Controlling Extreme Realizations

Maximum likelihood estimation in nonlinear models can exhibit substantial instability in finite samples when the data provide limited information about certain parameters. Such instability is driven by rare but extreme realizations of the…

Methodology · Statistics 2026-04-15 Masamune Iwasawa

The Risk of Machine Learning

Many applied settings in empirical economics involve simultaneous estimation of a large number of parameters. In particular, applied economists are often interested in estimating the effects of many-valued treatments (like teacher effects…

Machine Learning · Statistics 2017-04-03 Alberto Abadie , Maximilian Kasy

NICE: To Optimize In-Context Examples or Not?

Recent work shows that in-context learning and optimization of in-context examples (ICE) can significantly improve the accuracy of large language models (LLMs) on a wide range of tasks, leading to an apparent consensus that ICE optimization…

Computation and Language · Computer Science 2024-06-07 Pragya Srivastava , Satvik Golechha , Amit Deshpande , Amit Sharma

RAIN: Reinforcement Algorithms for Improving Numerical Weather and Climate Models

This study explores integrating reinforcement learning (RL) with idealised climate models to address key parameterisation challenges in climate science. Current climate models rely on complex mathematical parameterisations to represent…

Machine Learning · Computer Science 2025-04-17 Pritthijit Nath , Henry Moss , Emily Shuckburgh , Mark Webb

Meta-Learning with Generalized Ridge Regression: High-dimensional Asymptotics, Optimality and Hyper-covariance Estimation

Meta-learning involves training models on a variety of training tasks in a way that enables them to generalize well on new, unseen test tasks. In this work, we consider meta-learning within the framework of high-dimensional multivariate…

Statistics Theory · Mathematics 2024-04-01 Yanhao Jin , Krishnakumar Balasubramanian , Debashis Paul

Local Calibration: Metrics and Recalibration

Probabilistic classifiers output confidence scores along with their predictions, and these confidence scores should be calibrated, i.e., they should reflect the reliability of the prediction. Confidence scores that minimize standard metrics…

Machine Learning · Computer Science 2022-08-22 Rachel Luo , Aadyot Bhatnagar , Yu Bai , Shengjia Zhao , Huan Wang , Caiming Xiong , Silvio Savarese , Stefano Ermon , Edward Schmerling , Marco Pavone

Structure Learning in Inverse Ising Problems Using $\ell_2$-Regularized Linear Estimator

The inference performance of the pseudolikelihood method is discussed in the framework of the inverse Ising problem when the $\ell_2$-regularized (ridge) linear regression is adopted. This setup is introduced for theoretically investigating…

Disordered Systems and Neural Networks · Physics 2021-10-19 Xiangming Meng , Tomoyuki Obuchi , Yoshiyuki Kabashima

Class-wise Generalization Error: an Information-Theoretic Analysis

Existing generalization theories of supervised learning typically take a holistic approach and provide bounds for the expected generalization over the whole data distribution, which implicitly assumes that the model generalizes similarly…

Machine Learning · Computer Science 2024-01-08 Firas Laakom , Yuheng Bu , Moncef Gabbouj

Mutual Information Learned Regressor: an Information-theoretic Viewpoint of Training Regression Systems

As one of the central tasks in machine learning, regression finds lots of applications in different fields. An existing common practice for solving regression problems is the mean square error (MSE) minimization approach or its regularized…

Machine Learning · Statistics 2022-11-24 Jirong Yi , Qiaosheng Zhang , Zhen Chen , Qiao Liu , Wei Shao , Yusen He , Yaohua Wang

Understanding overfitting peaks in generalization error: Analytical risk curves for $l_2$ and $l_1$ penalized interpolation

Traditionally in regression one minimizes the number of fitting parameters or uses smoothing/regularization to trade training (TE) and generalization error (GE). Driving TE to zero by increasing fitting degrees of freedom (dof) is expected…

Machine Learning · Computer Science 2019-06-11 Partha P Mitra

When Does More Regularization Imply Fewer Degrees of Freedom? Sufficient Conditions and Counter Examples from Lasso and Ridge Regression

Regularization aims to improve prediction performance of a given statistical modeling approach by moving to a second approach which achieves worse training error but is expected to have fewer degrees of freedom, i.e., better agreement…

Statistics Theory · Mathematics 2013-11-13 Shachar Kaufman , Saharon Rosset

Generalization error in high-dimensional perceptrons: Approaching Bayes error with convex optimization

We consider a commonly studied supervised classification of a synthetic dataset whose labels are generated by feeding a one-layer neural network with random iid inputs. We study the generalization performances of standard classifiers in the…

Machine Learning · Statistics 2021-02-18 Benjamin Aubin , Florent Krzakala , Yue M. Lu , Lenka Zdeborová

Fundamental Limits of Ridge-Regularized Empirical Risk Minimization in High Dimensions

Empirical Risk Minimization (ERM) algorithms are widely used in a variety of estimation and prediction tasks in signal-processing and machine learning applications. Despite their popularity, a theory that explains their statistical…

Machine Learning · Statistics 2020-07-07 Hossein Taheri , Ramtin Pedarsani , Christos Thrampoulidis

Effective New Methods for Automated Parameter Selection in Regularized Inverse Problems

The choice of the parameter value for regularized inverse problems is critical to the results and remains a topic of interest. This article explores a criterion for selecting a good parameter value by maximizing the probability of the data,…

Numerical Analysis · Mathematics 2020-02-11 Toby Sanders , Rodrigo B. Platte , Robert D. Skeel

A Simple Correction Procedure for High-Dimensional Generalized Linear Models with Measurement Error

We consider high-dimensional generalized linear models when the covariates are contaminated by measurement error. Estimates from errors-in-variables regression models are well-known to be biased in traditional low-dimensional settings if…

Computation · Statistics 2020-01-06 Michael Byrd , Monnie McGee

Prediction Risk and Estimation Risk of the Ridgeless Least Squares Estimator under General Assumptions on Regression Errors

In recent years, there has been a significant growth in research focusing on minimum $\ell_2$ norm (ridgeless) interpolation least squares estimators. However, the majority of these analyses have been limited to an unrealistic regression…

Statistics Theory · Mathematics 2024-06-14 Sungyoon Lee , Sokbae Lee