Related papers: Linear regression model selection using p-values w…

Regression model selection via log-likelihood ratio and constrained minimum criterion

Although the log-likelihood is widely used in model selection, the log-likelihood ratio has had few applications in this area. We develop a log-likelihood ratio based method for selecting regression models by focusing on the set of models…

Methodology · Statistics 2021-09-28 Min Tsao

Consistency of Bayesian Linear Model Selection With a Growing Number of Parameters

Linear models with a growing number of parameters have been widely used in modern statistics. One important problem about this kind of model is the variable selection issue. Bayesian approaches, which provide a stochastic search of…

Statistics Theory · Mathematics 2012-02-03 Zuofeng Shang , Murray K. Clayton

Bayesian model choice and information criteria in sparse generalized linear models

We consider Bayesian model selection in generalized linear models that are high-dimensional, with the number of covariates p being large relative to the sample size n, but sparse in that the number of active covariates is small compared to…

Statistics Theory · Mathematics 2011-12-26 Rina Foygel , Mathias Drton

Statistical significance in high-dimensional linear models

We propose a method for constructing p-values for general hypotheses in a high-dimensional linear model. The hypotheses can be local for testing a single regression parameter or they may be more global involving several up to all…

Methodology · Statistics 2013-10-14 Peter Bühlmann

Selective Sequential Model Selection

Many model selection algorithms produce a path of fits specifying a sequence of increasingly complex models. Given such a sequence and the data used to produce them, we consider the problem of choosing the least complex model that is not…

Methodology · Statistics 2015-12-09 William Fithian , Jonathan Taylor , Robert Tibshirani , Ryan Tibshirani

A Minimum Message Length Criterion for Robust Linear Regression

This paper applies the minimum message length principle to inference of linear regression models with Student-t errors. A new criterion for variable selection and parameter estimation in Student-t regression is proposed. By exploiting…

Methodology · Statistics 2018-02-21 Chi Kuen Wong , Enes Makalic , Daniel F. Schmidt

Consistent Variable Selection for Functional Regression Models

The dual problem of testing the predictive significance of a particular covariate, and identification of the set of relevant covariates is common in applied research and methodological investigations. To study this problem in the context of…

Statistics Theory · Mathematics 2015-06-11 Julian A. A. Collazos , Adriano Z. Zambom

Model selection via Bayesian information capacity designs for generalised linear models

The first investigation is made of designs for screening experiments where the response variable is approximated by a generalised linear model. A Bayesian information capacity criterion is defined for the selection of designs that are…

Methodology · Statistics 2016-10-27 David C. Woods , James M. McGree , Susan M. Lewis

The Loss Rank Criterion for Variable Selection in Linear Regression Analysis

Lasso and other regularization procedures are attractive methods for variable selection, subject to a proper choice of shrinkage parameter. Given a set of potential subsets produced by a regularization algorithm, a consistent model…

Methodology · Statistics 2014-02-26 Minh-Ngoc Tran

Optimal subdata selection for linear model selection

If the assumed model does not accurately capture the underlying structure of the data, a statistical method is likely to yield sub-optimal results, and so model selection is crucial in order to conduct any statistical analysis. However, in…

Methodology · Statistics 2023-06-21 Vasilis Chasiotis , Dimitris Karlis

Linear Regression, Covariate Selection and the Failure of Modelling

It is argued that all model based approaches to the selection of covariates in linear regression have failed. This applies to frequentist approaches based on P-values and to Bayesian approaches although for different reasons. In the first…

Methodology · Statistics 2022-02-23 Laurie Davies

Three Approaches to Probability Model Selection

This paper compares three approaches to the problem of selecting among probability models to fit data (1) use of statistical criteria such as Akaike's information criterion and Schwarz's "Bayesian information criterion," (2) maximization of…

Methodology · Statistics 2016-11-04 William B. Poland , Ross D. Shachter

Robust model selection using likelihood as data

Model selection is a central task in statistics, but standard methods are not robust in misspecified settings where the true data-generating process (DGP) is not in the set of candidate models. The key limitation is that existing methods --…

Methodology · Statistics 2026-03-10 Jongwoo Choi , Neil A. Spencer , Jeffrey W. Miller

Selection by Prediction with Conformal p-values

Decision making or scientific discovery pipelines such as job hiring and drug discovery often involve multiple stages: before any resource-intensive step, there is often an initial screening that uses predictions from a machine learning…

Methodology · Statistics 2023-05-30 Ying Jin , Emmanuel J. Candès

A constrained minimum criterion for model selection

We propose a hypothesis test based model selection criterion for the best subset selection of sparse linear models. We show it is consistent in that the probability of its choosing the true model approaches one and the parameter values of…

Methodology · Statistics 2020-11-17 Min Tsao

A Robust Consistent Information Criterion for Model Selection based on Empirical Likelihood

Conventional likelihood-based information criteria for model selection rely on the distribution assumption of data. However, for complex data that are increasingly available in many scientific fields, the specification of their underlying…

Methodology · Statistics 2020-06-25 Chixiang Chen , Ming Wang , Rongling Wu , Runze Li

Consistent Bayesian Information Criterion Based on a Mixture Prior for Possibly High-Dimensional Multivariate Linear Regression Models

In the problem of selecting variables in a multivariate linear regression model, we derive new Bayesian information criteria based on a prior mixing a smooth distribution and a delta distribution. Each of them can be interpreted as a fusion…

Statistics Theory · Mathematics 2022-09-29 Haruki Kono , Tatsuya Kubokawa

Model Selection with the Loss Rank Principle

A key issue in statistics and machine learning is to automatically select the "right" model complexity, e.g., the number of neighbors to be averaged over in k nearest neighbor (kNN) regression or the polynomial degree in regression with…

Machine Learning · Computer Science 2010-10-04 Marcus Hutter , Minh-Ngoc Tran

Large-scale Nonlinear Variable Selection via Kernel Random Features

We propose a new method for input variable selection in nonlinear regression. The method is embedded into a kernel regression machine that can model general nonlinear functions, not being a priori limited to additive models. This is the…

Machine Learning · Computer Science 2018-09-05 Magda Gregorová , Jason Ramapuram , Alexandros Kalousis , Stéphane Marchand-Maillet

Regression Model Selection Under General Conditions

Model selection criteria are one of the most important tools in statistics. Proofs showing a model selection criterion is asymptotically optimal are tailored to the type of model (linear regression, quantile regression, penalized…

Statistics Theory · Mathematics 2025-10-17 Amaze Lusompa