Related papers: Robust model selection using likelihood as data

A Robust Consistent Information Criterion for Model Selection based on Empirical Likelihood

Conventional likelihood-based information criteria for model selection rely on the distribution assumption of data. However, for complex data that are increasingly available in many scientific fields, the specification of their underlying…

Methodology · Statistics 2020-06-25 Chixiang Chen , Ming Wang , Rongling Wu , Runze Li

Robust Data-Driven Decisions Under Model Uncertainty

When sample data are governed by an unknown sequence of independent but possibly non-identical distributions, the data-generating process (DGP) in general cannot be perfectly identified from the data. For making decisions facing such…

Theoretical Economics · Economics 2022-05-11 Xiaoyu Cheng

Model averaging approaches to data subset selection

Model averaging is a useful and robust method for dealing with model uncertainty in statistical analysis. Often, it is useful to consider data subset selection at the same time, in which model selection criteria are used to compare models…

Methodology · Statistics 2023-10-26 Ethan T. Neil , Jacob W. Sitison

Improving Robust Decisions with Data

A decision-maker faces uncertainty governed by a data-generating process (DGP), which is only known to belong to a set of sequences of independent but possibly non-identical distributions. A robust decision maximizes the expected payoff…

Theoretical Economics · Economics 2026-02-12 Xiaoyu Cheng

Robust and consistent model evaluation criteria in high-dimensional regression

Most of the regularization methods such as the LASSO have one (or more) regularization parameter(s), and to select the value of the regularization parameter is essentially equal to select a model. Thus, to obtain a model suitable for the…

Methodology · Statistics 2025-11-07 Sumito Kurata , Kei Hirose

A model robust sub-sampling approach for Generalised Linear Models in Big data settings

In today's modern era of Big data, computationally efficient and scalable methods are needed to support timely insights and informed decision making. One such method is sub-sampling, where a subset of the Big data is analysed and used as…

Methodology · Statistics 2022-09-07 Amalan Mahendran , Helen Thompson , James M. McGree

Comparing and weighting imperfect models using D-probabilities

We propose a new approach for assigning weights to models using a divergence-based method ({\em D-probabilities}), relying on evaluating parametric models relative to a nonparametric Bayesian reference using Kullback-Leibler divergence.…

Methodology · Statistics 2019-04-30 Meng Li , David B. Dunson

Robust Clustering with Normal Mixture Models: A Pseudo $\beta$-Likelihood Approach

As in other estimation scenarios, likelihood based estimation in the normal mixture set-up is highly non-robust against model misspecification and presence of outliers (apart from being an ill-posed optimization problem). A robust…

Methodology · Statistics 2023-12-20 Soumya Chakraborty , Ayanendranath Basu , Abhik Ghosh

Robust Fitting for Generalized Additive Models for Location, Scale and Shape

The validity of estimation and smoothing parameter selection for the wide class of generalized additive models for location, scale and shape (GAMLSS) relies on the correct specification of a likelihood function. Deviations from such…

Methodology · Statistics 2019-11-14 William H. Aeberhard , Eva Cantoni , Giampiero Marra , Rosalba Radice

Regression model selection via log-likelihood ratio and constrained minimum criterion

Although the log-likelihood is widely used in model selection, the log-likelihood ratio has had few applications in this area. We develop a log-likelihood ratio based method for selecting regression models by focusing on the set of models…

Methodology · Statistics 2021-09-28 Min Tsao

Robust variable selection for model-based learning in presence of adulteration

The problem of identifying the most discriminating features when performing supervised learning has been extensively investigated. In particular, several methods for variable selection in model-based classification have been proposed.…

Applications · Statistics 2020-12-16 Andrea Cappozzo , Francesca Greselin , Thomas Brendan Murphy

Robust Probabilistic Modeling with Bayesian Data Reweighting

Probabilistic models analyze data by relying on a set of assumptions. Data that exhibit deviations from these assumptions can undermine inference and prediction quality. Robust models offer protection against mismatch between a model's…

Machine Learning · Statistics 2018-06-20 Yixin Wang , Alp Kucukelbir , David M. Blei

Linear regression model selection using p-values when the model dimension grows

We consider a new criterion-based approach to model selection in linear regression. Properties of selection criteria based on p-values of a likelihood ratio statistic are studied for families of linear regression models. We prove that such…

Statistics Theory · Mathematics 2012-05-21 Piotr Pokarowski , Jan Mielniczuk , Paweł Teisseyre

Statistical Classification via Robust Hypothesis Testing

In this letter, we consider multiple statistical classification problem where a sequence of n independent and identically distributed observations, that are generated by one of M discrete sources, need to be classified. The source…

Information Theory · Computer Science 2021-08-31 Hüseyin Afşer

Bias Correction of Learned Generative Models using Likelihood-Free Importance Weighting

A learned generative model often produces biased statistics relative to the underlying data distribution. A standard technique to correct this bias is importance sampling, where samples from the model are weighted by the likelihood ratio…

Machine Learning · Statistics 2019-11-05 Aditya Grover , Jiaming Song , Alekh Agarwal , Kenneth Tran , Ashish Kapoor , Eric Horvitz , Stefano Ermon

Selection Criterion for Log-Linear Models Using Statistical Learning Theory

Log-linear models are a well-established method for describing statistical dependencies among a set of n random variables. The observed frequencies of the n-tuples are explained by a joint probability such that its logarithm is a sum of…

Statistics Theory · Mathematics 2007-06-13 Daniel Herrmann , Dominik Janzing

Robust Estimation in Generalised Linear Models : The Density Power Divergence Approach

The generalised linear model (GLM) is a very important tool for analysing real data in biology, sociology, agriculture, engineering and many other application domain where the relationship between the response and explanatory variables may…

Methodology · Statistics 2016-07-04 Abhik Ghosh , Ayanendranath Basu

Robust Variable Selection Criteria for the Penalized Regression

We propose a robust variable selection procedure using a divergence based M-estimator combined with a penalty function. It produces robust estimates of the regression parameters and simultaneously selects the important explanatory…

Methodology · Statistics 2020-01-01 Abhijit Mandal , Samiran Ghosh

Robust Variable Selection in High-dimensional Nonparametric Additive Model

Additive models belong to the class of structured nonparametric regression models that do not suffer from the curse of dimensionality. Finding the additive components that are nonzero when the true model is assumed to be sparse is an…

Methodology · Statistics 2025-05-08 Suneel Babu Chatla , Abhijit Mandal

Model Selection for independent not identically distributed observations based on R\'enyi's pseudodistances

Model selection criteria are rules used to select the best statistical model among a set of candidate models, striking a trade-off between goodness of fit and model complexity. Most popular model selection criteria measure the goodness of…

Statistics Theory · Mathematics 2023-04-13 Angel Felipe , Maria Jaenada , Pedro Miranda , Leandro Pardo