Related papers: Asymptotic optimality of a cross-validatory predic…

Regression Model Selection Under General Conditions

Model selection criteria are one of the most important tools in statistics. Proofs showing a model selection criterion is asymptotically optimal are tailored to the type of model (linear regression, quantile regression, penalized…

Statistics Theory · Mathematics 2025-10-17 Amaze Lusompa

Model selection for estimation of causal parameters

A popular technique for selecting and tuning machine learning estimators is cross-validation. Cross-validation evaluates overall model fit, usually in terms of predictive accuracy. In causal inference, the optimal choice of estimator…

Methodology · Statistics 2021-07-07 Dominik Rothenhäusler

Local asymptotics of cross-validation in least-squares density estimation

In model selection, several types of cross-validation are commonly used and many variants have been introduced. While consistency of some of these methods has been proven, their rate of convergence to the oracle is generally still unknown.…

Statistics Theory · Mathematics 2021-06-21 Guillaume Maillard

On the Asymptotic Optimality of Cross-Validation based Hyper-parameter Estimators for Regularized Least Squares Regression Problems

The asymptotic optimality (a.o.) of various hyper-parameter estimators with different optimality criteria has been studied in the literature for regularized least squares regression problems. The estimators include e.g., the maximum…

Statistics Theory · Mathematics 2021-04-28 Biqiang Mu , Tianshi Chen , Lennart Ljung

Variable Selection for Linear Regression Imputation in Surveys

Survey sampling is concerned with the estimation of finite population parameters. In practice, survey data suffer from item nonresponse, which is commonly handled through imputation, i.e., replacing missing values with predicted values. As…

Methodology · Statistics 2026-03-06 Ziming An , Mehdi Dagdoug , David Haziza

Non-asymptotic model selection for linear non least-squares estimation in regression models and inverse problems

We propose to address the common problem of linear estimation in linear statistical models by using a model selection approach via penalization. Depending then on the framework in which the linear statistical model is considered namely the…

Statistics Theory · Mathematics 2009-09-11 Ikhlef Bechar

Asymptotic Accuracy of Distribution-Based Estimation for Latent Variables

Hierarchical statistical models are widely employed in information science and data engineering. The models consist of two types of variables: observable variables that represent the given data and latent variables for the unobservable…

Machine Learning · Statistics 2014-02-21 Keisuke Yamazaki

Cross-Validation, Risk Estimation, and Model Selection

Cross-validation is a popular non-parametric method for evaluating the accuracy of a predictive rule. The usefulness of cross-validation depends on the task we want to employ it for. In this note, I discuss a simple non-parametric setting,…

Methodology · Statistics 2019-09-27 Stefan Wager

Optimal model averaging for single-index models with divergent dimensions

This paper offers a new approach to address the model uncertainty in (potentially) divergent-dimensional single-index models (SIMs). We propose a model-averaging estimator based on cross-validation, which allows the dimension of covariates…

Methodology · Statistics 2022-06-14 Jiahui Zou , Wendun Wang , Xinyu Zhang , Guohua Zou

Model selection by cross-validation in an expectile linear regression

For linear models that may have asymmetric errors, we study variable selection by cross-validation. The data are split into training and validation sets, with the number of observations in the validation set much larger than in the training…

Methodology · Statistics 2026-01-16 Bilel Bousselmi , Gabriela Ciuperca

Model selection for the robust efficient signal processing observed with small L\'evy noise

We develop a new model selection method for the adaptive robust efficient nonparametric signal estimation observed with impulse noise which is defined by the general non Gaussian L\'evy processes. On the basis of the developed method, we…

Statistics Theory · Mathematics 2018-11-27 Slim Beltaief , Oleg Chernoyarov , Serguei Pergamenchtchikov

Penalized regression with multiple loss functions and selection by vote

This article considers a linear model in a high dimensional data scenario. We propose a process which uses multiple loss functions both to select relevant predictors and to estimate parameters, and study its asymptotic properties. Variable…

Methodology · Statistics 2020-07-01 Guorong Dai , Ursula U. Müller

Improving prediction accuracy by choosing resampling distribution via cross-validation

In a regression model, prediction is typically performed after model selection. The large variability in the model selection makes the prediction unstable. Thus, it is essential to reduce the variability in model selection and improve…

Computation · Statistics 2024-04-11 Wataru Yoshida , Kei Hirose

Precise Asymptotics for Linear Mixed Models with Crossed Random Effects

We obtain an asymptotic normality result that reveals the precise asymptotic behavior of the maximum likelihood estimators of parameters for a very general class of linear mixed models containing cross random effects. In achieving the…

Statistics Theory · Mathematics 2026-02-10 Jiming Jiang , Matt P. Wand , Swarnadip Ghosh

Estimating sufficient reductions of the predictors in abundant high-dimensional regressions

We study the asymptotic behavior of a class of methods for sufficient dimension reduction in high-dimension regressions, as the sample size and number of predictors grow in various alignments. It is demonstrated that these methods are…

Statistics Theory · Mathematics 2012-05-31 R. Dennis Cook , Liliana Forzani , Adam J. Rothman

On Statistical Efficiency in Learning

A central issue of many statistical learning problems is to select an appropriate model from a set of candidate models. Large models tend to inflate the variance (or overfitting), while small models tend to cause biases (or underfitting)…

Statistics Theory · Mathematics 2020-12-25 Jie Ding , Enmao Diao , Jiawei Zhou , Vahid Tarokh

Estimation and Test for Multidimensional Regression Models

This work is concerned with the estimation of multidimensional regression and the asymptotic behaviour of the test involved in selecting models. The main problem with such models is that we need to know the covariance matrix of the noise to…

Statistics Theory · Mathematics 2008-02-20 Joseph Rynkiewicz

Distributional regression with reject option

Selective prediction, where a model has the option to abstain from making a decision, is crucial for machine learning applications in which mistakes are costly. In this work, we focus on distributional regression and introduce a framework…

Statistics Theory · Mathematics 2025-04-01 Ahmed Zaoui , Clément Dombry

Sharp non-asymptotic oracle inequalities for nonparametric heteroscedastic regression models

An adaptive nonparametric estimation procedure is constructed for heteroscedastic regression when the noise variance depends on the unknown regression. A non-asymptotic upper bound for a quadratic risk (oracle inequality) is obtained

Statistics Theory · Mathematics 2010-02-09 Leonid Galtchouk , Serguei Pergamenchtchikov

Optimal model selection in density estimation

We build penalized least-squares estimators using the slope heuristic and resampling penalties. We prove oracle inequalities for the selected estimator with leading constant asymptotically equal to 1. We compare the practical performances…

Statistics Theory · Mathematics 2015-03-13 Matthieu Lerasle