Related papers: Model Selection over Partially Ordered Sets

Formal and Informal Model Selection with Incomplete Data

Model selection and assessment with incomplete data pose challenges in addition to the ones encountered with complete data. There are two main reasons for this. First, many models describe characteristics of the complete data, in spite of…

Methodology · Statistics 2008-08-28 Geert Verbeke , Geert Molenberghs , Caroline Beunckens

Consensus Tree Estimation with False Discovery Rate Control via Partially Ordered Sets

Connected acyclic graphs (trees) are data objects that hierarchically organize categories. Collections of trees arise in a diverse variety of fields, including evolutionary biology, public health, machine learning, social sciences and…

Methodology · Statistics 2025-12-01 Maria Alejandra Valdez Cabrera , Amy D Willis , Armeen Taeb

Causally Correct Partial Models for Reinforcement Learning

In reinforcement learning, we can learn a model of future observations and rewards, and use it to plan the agent's next actions. However, jointly modeling future observations can be computationally expensive or even intractable if the…

Machine Learning · Computer Science 2020-02-10 Danilo J. Rezende , Ivo Danihelka , George Papamakarios , Nan Rosemary Ke , Ray Jiang , Theophane Weber , Karol Gregor , Hamza Merzic , Fabio Viola , Jane Wang , Jovana Mitrovic , Frederic Besse , Ioannis Antonoglou , Lars Buesing

Efficient estimation and correction of selection-induced bias with order statistics

Model selection aims to identify a sufficiently well performing model that is possibly simpler than the most complex model among a pool of candidates. However, the decision-making process itself can inadvertently introduce non-negligible…

Methodology · Statistics 2024-08-08 Yann McLatchie , Aki Vehtari

Calibrated Selective Classification

Selective classification allows models to abstain from making predictions (e.g., say "I don't know") when in doubt in order to obtain better effective accuracy. While typical selective models can be effective at producing more accurate…

Machine Learning · Computer Science 2024-06-24 Adam Fisch , Tommi Jaakkola , Regina Barzilay

A Note on Bayesian Model Selection for Discrete Data Using Proper Scoring Rules

We consider the problem of choosing between parametric models for a discrete observable, taking a Bayesian approach in which the within-model prior distributions are allowed to be improper. In order to avoid the ambiguity in the marginal…

Statistics Theory · Mathematics 2020-04-28 A. Philip Dawid , Monica Musio , Silvia Columbu

Learning with Proper Partial Labels

Partial-label learning is a kind of weakly-supervised learning with inexact labels, where for each training example, we are given a set of candidate labels instead of only one true label. Recently, various approaches on partial-label…

Machine Learning · Computer Science 2022-08-30 Zhenguo Wu , Jiaqi Lv , Masashi Sugiyama

Variable selection using pseudo-variables

Penalized regression has become a standard tool for model building across a wide range of application domains. Common practice is to tune the amount of penalization to tradeoff bias and variance or to optimize some other measure of…

Methodology · Statistics 2018-04-05 Wenhao Hu , Eric Laber , Leonard Stefanski

Partial Order on the set of Boolean Regulatory Functions

Logical models have been successfully used to describe regulatory and signaling networks without requiring quantitative data. However, existing data is insufficient to adequately define a unique model, rendering the parametrization of a…

Discrete Mathematics · Computer Science 2019-01-24 José E. R. Cury , Pedro T. Monteiro , Claudine Chaouiya

Variable selection in measurement error models

Measurement error data or errors-in-variable data have been collected in many studies. Natural criterion functions are often unavailable for general functional measurement error models due to the lack of information on the distribution of…

Statistics Theory · Mathematics 2010-02-24 Yanyuan Ma , Runze Li

Stability Selection for Structured Variable Selection

In variable or graph selection problems, finding a right-sized model or controlling the number of false positives is notoriously difficult. Recently, a meta-algorithm called Stability Selection was proposed that can provide reliable…

Machine Learning · Statistics 2017-12-14 George Philipp , Seunghak Lee , Eric P. Xing

What Does It Take to Build a Performant Selective Classifier?

Selective classifiers improve model reliability by abstaining on inputs the model deems uncertain. However, few practical approaches achieve the gold-standard performance of a perfect-ordering oracle that accepts examples exactly in order…

Machine Learning · Computer Science 2025-10-27 Stephan Rabanser , Nicolas Papernot

Boolean modeling of collective effects in complex networks

Complex systems are often modeled as Boolean networks in attempts to capture their logical structure and reveal its dynamical consequences. Approximating the dynamics of continuous variables by discrete values and Boolean logic gates may,…

Molecular Networks · Quantitative Biology 2013-05-29 Johannes Norrell , Joshua E. S. Socolar

Multiple Testing and Error Control in Gaussian Graphical Model Selection

Graphical models provide a framework for exploration of multivariate dependence patterns. The connection between graph and statistical model is made by identifying the vertices of the graph with the observed variables and translating the…

Statistics Theory · Mathematics 2008-02-08 Mathias Drton , Michael D. Perlman

Bootstrapping multiple systems estimates to account for model selection

Multiple systems estimation using a Poisson loglinear model is a standard approach to quantifying hidden populations where data sources are based on lists of known cases. Information criteria are often used for selecting between the large…

Methodology · Statistics 2023-11-23 Bernard W. Silverman , Lax Chan , Kyle Vincent

Causal learning with sufficient statistics: an information bottleneck approach

The inference of causal relationships using observational data from partially observed multivariate systems with hidden variables is a fundamental question in many scientific domains. Methods extracting causal information from conditional…

Machine Learning · Statistics 2020-10-13 Daniel Chicharro , Michel Besserve , Stefano Panzeri

Bayesian Model Selection Based on Proper Scoring Rules

Bayesian model selection with improper priors is not well-defined because of the dependence of the marginal likelihood on the arbitrary scaling constants of the within-model prior densities. We show how this problem can be evaded by…

Statistics Theory · Mathematics 2020-04-28 A. Philip Dawid , Monica Musio

Modeling and Correcting Bias in Sequential Evaluation

We consider the problem of sequential evaluation, in which an evaluator observes candidates in a sequence and assigns scores to these candidates in an online, irrevocable fashion. Motivated by the psychology literature that has studied…

Machine Learning · Statistics 2023-11-20 Jingyan Wang , Ashwin Pananjady

Partitioned Learned Bloom Filter

Bloom filters are space-efficient probabilistic data structures that are used to test whether an element is a member of a set, and may return false positives. Recently, variations referred to as learned Bloom filters were developed that can…

Data Structures and Algorithms · Computer Science 2020-10-06 Kapil Vaidya , Eric Knorr , Tim Kraska , Michael Mitzenmacher

Beyond the Selected Completely At Random Assumption for Learning from Positive and Unlabeled Data

Most positive and unlabeled data is subject to selection biases. The labeled examples can, for example, be selected from the positive set because they are easier to obtain or more obviously positive. This paper investigates how learning can…

Machine Learning · Computer Science 2019-07-01 Jessa Bekker , Pieter Robberechts , Jesse Davis