Related papers: Model Selection Confidence Sets by Likelihood Rati…

Model selection confidence sets for time series models with applications to electricity load data

This paper studies the Model Selection Confidence Set (MSCS) methodology for univariate time series models involving autoregressive and moving average components, and applies it to study model selection uncertainty in the Italian…

Econometrics · Economics 2026-02-19 Piersilvio De Bortoli , Davide Ferrari , Francesco Ravazzolo , Luca Rossini

Confidence set for mixture order selection

A fundamental challenge in approximating an unknown density using finite Gaussian mixture models is selecting the number of mixture components, also known as order. Traditional approaches choose a single best model using information…

Methodology · Statistics 2025-06-25 Alessandro Casa , Davide Ferrari

Model Class Selection

Classical model selection seeks to find a single model within a particular class that optimizes some pre-specified criteria, such as maximizing a likelihood or minimizing a risk. More recently, there has been an increased interest in model…

Methodology · Statistics 2025-11-17 Ryan Cecil , Lucas Mentch

Sequential model confidence sets

In most prediction and estimation situations, scientists consider various statistical models for the same problem, and naturally want to select amongst the best. Hansen et al. (2011) provide a powerful solution to this problem by the…

Methodology · Statistics 2026-01-23 Sebastian Arnold , Georgios Gavrilopoulos , Benedikt Schulz , Johanna Ziegel

Selection Confidence Sets for Equally Weighted Portfolios

Given a universe of N assets, investors often form equally weighted portfolios (EWPs) by selecting subsets of assets. EWPs are simple, robust, and competitive out-of-sample, yet the uncertainty about which subset truly performs best is…

Portfolio Management · Quantitative Finance 2025-10-20 Davide Ferrari , Alessandro Fulci , Sandra Paterlini

Model Confidence Bounds for Variable Selection

In this article, we introduce the concept of model confidence bounds (MCB) for variable selection in the context of nested models. Similarly to the endpoints in the familiar confidence interval for parameter estimation, the MCB identifies…

Methodology · Statistics 2018-07-27 Yang Li , Yuetian Luo , Davide Ferrari , Xiaonan Hu , Yichen Qin

Monte Carlo Confidence Sets for Identified Sets

In complicated/nonlinear parametric models, it is generally hard to know whether the model parameters are point identified. We provide computationally attractive procedures to construct confidence sets (CSs) for identified sets of full…

Methodology · Statistics 2022-06-06 Xiaohong Chen , Timothy Christensen , Elie Tamer

Provable Model Provenance Set for Large Language Models

The growing prevalence of unauthorized model usage and misattribution has increased the need for reliable model provenance analysis. However, existing methods largely rely on heuristic fingerprint-matching rules that lack provable error…

Machine Learning · Computer Science 2026-02-03 Xiaoqi Qiu , Hao Zeng , Zhiyu Hou , Hongxin Wei

Likelihood Ratio Confidence Sets for Sequential Decision Making

Certifiable, adaptive uncertainty estimates for unknown quantities are an essential ingredient of sequential decision-making algorithms. Standard approaches rely on problem-dependent concentration results and are limited to a specific…

Machine Learning · Computer Science 2023-11-09 Nicolas Emmenegger , Mojmír Mutný , Andreas Krause

Calibrated Selective Classification

Selective classification allows models to abstain from making predictions (e.g., say "I don't know") when in doubt in order to obtain better effective accuracy. While typical selective models can be effective at producing more accurate…

Machine Learning · Computer Science 2024-06-24 Adam Fisch , Tommi Jaakkola , Regina Barzilay

Exact Multivariate Tests - A New Effective Principle of Controlled Model Choice

High-dimensional tests are applied to find relevant sets of variables and relevant models. If variables are selected by analyzing the sums of products matrices and a corresponding mean-value test is performed, there is the danger that the…

Methodology · Statistics 2012-02-10 Juergen Laeuter , Maciej Rosolowski , Ekkehard Glimm

Sparse maximum likelihood estimation for regression models

For regression model selection via maximum likelihood estimation, we adopt a vector representation of candidate models and study the likelihood ratio confidence region for the regression parameter vector of a full model. We show that when…

Statistics Theory · Mathematics 2024-04-09 Min Tsao

From Model Choice to Model Belief: Establishing a New Measure for LLM-Based Research

Large language models (LLMs) are increasingly used to simulate human behavior, but common practices to use LLM-generated data are inefficient. Treating an LLM's output ("model choice") as a single data point underutilizes the information…

Artificial Intelligence · Computer Science 2025-12-30 Hongshen Sun , Juanjuan Zhang

Sample-Cluster-Select: A new framework to obtain diverse approximate solutions of combinatorial optimization problems

When solving real-world problems, practitioners often hesitate to implement solutions obtained from mathematical models, especially for important decisions. This hesitation stems from practitioners' lack of trust in optimization models and…

Optimization and Control · Mathematics 2025-07-01 Susumu Hashimoto , Takeaki Uno

Conditional Method Confidence Set

This paper proposes a Conditional Method Confidence Set (CMCS) which allows to select the best subset of forecasting methods with equal predictive ability conditional on a specific economic regime. The test resembles the Model Confidence…

Econometrics · Economics 2025-05-28 Lukas Bauer , Ekaterina Kazak

Cross-Validation with Confidence

Cross-validation is one of the most popular model selection methods in statistics and machine learning. Despite its wide applicability, traditional cross validation methods tend to select overfitting models, due to the ignorance of the…

Methodology · Statistics 2017-12-25 Jing Lei

Test Selection for Deep Learning Systems

Testing of deep learning models is challenging due to the excessive number and complexity of computations involved. As a result, test data selection is performed manually and in an ad hoc way. This raises the question of how we can…

Machine Learning · Computer Science 2019-05-01 Wei Ma , Mike Papadakis , Anestis Tsakmalis , Maxime Cordy , Yves Le Traon

Robust model selection using likelihood as data

Model selection is a central task in statistics, but standard methods are not robust in misspecified settings where the true data-generating process (DGP) is not in the set of candidate models. The key limitation is that existing methods --…

Methodology · Statistics 2026-03-10 Jongwoo Choi , Neil A. Spencer , Jeffrey W. Miller

A Probabilistic Framework for LLM-Based Model Discovery

Automated methods for discovering mechanistic simulator models from observational data offer a promising path toward accelerating scientific progress. Such methods often take the form of agentic-style iterative workflows that repeatedly…

Machine Learning · Computer Science 2026-02-23 Stefan Wahl , Raphaela Schenk , Ali Farnoud , Jakob H. Macke , Daniel Gedon

Multi-Perspective Consistency Enhances Confidence Estimation in Large Language Models

In the deployment of large language models (LLMs), accurate confidence estimation is critical for assessing the credibility of model predictions. However, existing methods often fail to overcome the issue of overconfidence on incorrect…

Computation and Language · Computer Science 2024-02-20 Pei Wang , Yejie Wang , Muxi Diao , Keqing He , Guanting Dong , Weiran Xu