Related papers: Probability-Based Estimation

Minimizing Uncertainty in Prevalence Estimates

Estimating prevalence, the fraction of a population with a certain medical condition, is fundamental to epidemiology. Traditional methods rely on classification of test samples taken at random from a population. Such approaches to…

Methodology · Statistics 2022-03-25 Paul Patrone , Anthony Kearsley

Evaluating probabilities without model risk

This article presents methods for estimating extreme probabilities, beyond the range of the observations. These methods are model-free and applicable to almost any sample size. They are grounded in order statistics theory and have a wide…

Applications · Statistics 2025-04-03 Joan del Castillo , Pedro Puig

Exploring the Difficulty of Estimating Win Probability: A Simulation Study

Estimating win probability is one of the classic modeling tasks of sports analytics. Many widely used win probability estimators use machine learning to fit the relationship between a binary win/loss outcome variable and certain game-state…

Methodology · Statistics 2025-08-21 Ryan S. Brill , Ronald Yurko , Abraham J. Wyner

Pulling back the curtain: the road from statistical estimand to machine-learning based estimator for epidemiologists (no wizard required)

Epidemiologists increasingly use causal inference methods that rely on machine learning, as these approaches can relax unnecessary model specification assumptions. While deriving and studying asymptotic properties of such estimators is a…

Methodology · Statistics 2025-02-11 Audrey Renson , Lina Montoya , Dana E. Goin , Iván Díaz , Rachael K. Ross

Valid inferential models for prediction in supervised learning problems

Prediction, where observed data is used to quantify uncertainty about a future observation, is a fundamental problem in statistics. Prediction sets with coverage probability guarantees are a common solution, but these do not provide…

Statistics Theory · Mathematics 2022-11-22 Leonardo Cella , Ryan Martin

Efficient simulation for dependent rare events with applications to extremes

We consider the general problem of estimating probabilities which arise as a union of dependent events. We propose a flexible series of estimators for such probabilities, and describe variance reduction schemes applied to the proposed…

Probability · Mathematics 2016-10-10 Lars Nørvang Andersen , Patrick J. Laub , Leonardo Rojas-Nandayapa

Overcoming Selection Bias in Statistical Studies With Amortized Bayesian Inference

Selection bias arises when the probability that an observation enters a dataset depends on variables related to the quantities of interest, leading to systematic distortions in estimation and uncertainty quantification. For example, in…

Machine Learning · Statistics 2026-04-21 Jonas Arruda , Sophie Chervet , Paula Staudt , Andreas Wieser , Michael Hoelscher , Isabelle Sermet-Gaudelus , Nadine Binder , Lulla Opatowski , Jan Hasenauer

Knowledge from Probability

We give a probabilistic analysis of inductive knowledge and belief and explore its predictions concerning knowledge about the future, about laws of nature, and about the values of inexactly measured quantities. The analysis combines a…

Logic in Computer Science · Computer Science 2021-06-23 Jeremy Goodman , Bernhard Salow

Bias in parametric estimation: reduction and useful side-effects

The bias of an estimator is defined as the difference of its expected value from the parameter to be estimated, where the expectation is with respect to the model. Loosely speaking, small bias reflects the desire that if an experiment is…

Methodology · Statistics 2018-02-16 Ioannis Kosmidis

Tests and estimation strategies associated to some loss functions

We consider the problem of estimating the joint distribution of $n$ independent random variables. Our approach is based on a family of candidate probabilities that we shall call a model and which is chosen to either contain the true…

Statistics Theory · Mathematics 2021-06-01 Yannick Baraud

Reducing bias and variance in quantile estimates with an exponential model

Percentiles and more generally, quantiles are commonly used in various contexts to summarize data. For most distributions, there is exactly one quantile that is unbiased. For distributions like the Gaussian that have the same mean and…

Methodology · Statistics 2022-01-11 Rohit Pandey

Influence Functions for Machine Learning: Nonparametric Estimators for Entropies, Divergences and Mutual Informations

We propose and analyze estimators for statistical functionals of one or more distributions under nonparametric assumptions. Our estimators are based on the theory of influence functions, which appear in the semiparametric statistics…

Machine Learning · Statistics 2015-07-21 Kirthevasan Kandasamy , Akshay Krishnamurthy , Barnabas Poczos , Larry Wasserman , James M. Robins

Estimating the number of classes

Estimating the unknown number of classes in a population has numerous important applications. In a Poisson mixture model, the problem is reduced to estimating the odds that a class is undetected in a sample. The discontinuity of the odds…

Statistics Theory · Mathematics 2007-08-22 Chang Xuan Mao , Bruce G. Lindsay

Evaluating probability forecasts

Probability forecasts of events are routinely used in climate predictions, in forecasting default probabilities on bank loans or in estimating the probability of a patient's positive response to treatment. Scoring rules have long been used…

Statistics Theory · Mathematics 2012-02-24 Tze Leung Lai , Shulamith T. Gross , David Bo Shen

A Theory of Uncertainty Variables for State Estimation and Inference

We develop a new framework of uncertainty variables to model uncertainty. An uncertainty variable is characterized by an uncertainty set, in which its realization is bound to lie, while the conditional uncertainty is characterized by a set…

Machine Learning · Statistics 2019-12-10 Rajat Talak , Sertac Karaman , Eytan Modiano

Non-probabilistic odds and forecasting with imperfect models

Probability forecasts are intended to account for the uncertainties inherent in forecasting. It is suggested that from an end-user's point of view probability is not necessarily sufficient to reflect uncertainties that are not simply the…

Statistics Theory · Mathematics 2015-01-22 Kevin Judd

Expertises : proc\'{e}dures statistiques d'aide \`{a} la d\'{e}cision

In this study, we introduce a new approach to statistical decision theory. Without using a loss function, we select good decision rules to choice between two hypotheses. We call them "experts". They are globally unbiased but also…

Statistics Theory · Mathematics 2007-06-13 Guy Morel

Non-deterministic inference using random set models: theory, approximation, and sampling method

A random set is a generalisation of a random variable, i.e. a set-valued random variable. The random set theory allows a unification of other uncertainty descriptions such as interval variable, mass belief function in Dempster-Shafer theory…

Numerical Analysis · Mathematics 2018-11-27 Truong-Vinh Hoang , Hermann G. Matthies

Bayesian Inference for Non-Parametric Extreme Value Theory

Statistical inference for extreme values of random events is difficult in practice due to low sample sizes and inaccurate models for the studied rare events. If prior knowledge for extreme values is available, Bayesian statistics can be…

Methodology · Statistics 2022-05-18 Tobias Kallehauge

A Note on High-Probability versus In-Expectation Guarantees of Generalization Bounds in Machine Learning

Statistical machine learning theory often tries to give generalization guarantees of machine learning models. Those models naturally underlie some fluctuation, as they are based on a data sample. If we were unlucky, and gathered a sample…

Machine Learning · Computer Science 2022-11-21 Alexander Mey