Related papers: A rigorous lower confidence bound for the expectat…

Time-uniform confidence bands for the CDF under nonstationarity

Estimation of the complete distribution of a random variable is a useful primitive for both manual and automated decision making. This problem has received extensive attention in the i.i.d. setting, but the arbitrary data dependent setting…

Machine Learning · Statistics 2023-03-01 Paul Mineiro , Steven R. Howard

Upper bounds on the minimum coverage probability of confidence intervals in regression after variable selection

We consider a linear regression model, with the parameter of interest a specified linear combination of the regression parameter vector. We suppose that, as a first step, a data-based model selection (e.g. by preliminary hypothesis tests or…

Statistics Theory · Mathematics 2011-09-27 Paul Kabaila , Khageswor Giri

Finite sample valid confidence sets of mode

Estimating the mode of a unimodal distribution is a classical problem in statistics. Although there are several approaches for point-estimation of mode in the literature, very little has been explored about the interval-estimation of mode.…

Statistics Theory · Mathematics 2025-04-01 Manit Paul , Arun Kumar Kuchibhotla

Is distribution-free inference possible for binary regression?

For a regression problem with a binary label response, we examine the problem of constructing confidence intervals for the label probability conditional on the features. In a setting where we do not have any information about the underlying…

Statistics Theory · Mathematics 2020-10-09 Rina Foygel Barber

The distribution of a linear predictor after model selection: Unconditional finite-sample distributions and asymptotic approximations

We analyze the (unconditional) distribution of a linear predictor that is constructed after a data-driven model selection step in a linear regression model. First, we derive the exact finite-sample cumulative distribution function (cdf) of…

Statistics Theory · Mathematics 2008-12-02 Hannes Leeb

Confidence regions for the multinomial parameter with small sample size

Consider the observation of n iid realizations of an experiment with d>1 possible outcomes, which corresponds to a single observation of a multinomial distribution M(n,p) where p is an unknown discrete distribution on {1,...,d}. In many…

Computation · Statistics 2010-06-15 Djalil Chafai , Didier Concordet

Optimal Confidence Regions for the Multinomial Parameter

Construction of tight confidence regions and intervals is central to statistical inference and decision making. This paper develops new theory showing minimum average volume confidence regions for categorical data. More precisely, consider…

Machine Learning · Statistics 2021-02-01 Matthew L. Malloy , Ardhendu Tripathy , Robert D. Nowak

The limits of distribution-free conditional predictive inference

We consider the problem of distribution-free predictive inference, with the goal of producing predictive coverage guarantees that hold conditionally rather than marginally. Existing methods such as conformal prediction offer marginal…

Statistics Theory · Mathematics 2020-04-16 Rina Foygel Barber , Emmanuel J. Candès , Aaditya Ramdas , Ryan J. Tibshirani

A New Confidence Interval for the Mean of a Bounded Random Variable

We present a new method for constructing a confidence interval for the mean of a bounded random variable from samples of the random variable. We conjecture that the confidence interval has guaranteed coverage, i.e., that it contains the…

Statistics Theory · Mathematics 2020-11-05 Erik Learned-Miller , Philip S. Thomas

A Probabilistic Upper Bound on Differential Entropy

A novel, non-trivial, probabilistic upper bound on the entropy of an unknown one-dimensional distribution, given the support of the distribution and a sample from that distribution, is presented. No knowledge beyond the support of the…

Information Theory · Computer Science 2007-07-13 Joseph DeStefano , Erik Learned-Miller

Confidence Intervals for Low-Dimensional Parameters in High-Dimensional Linear Models

The purpose of this paper is to propose methodologies for statistical inference of low-dimensional parameters with high-dimensional data. We focus on constructing confidence intervals for individual coefficients and linear combinations of…

Methodology · Statistics 2012-11-05 Cun-Hui Zhang , Stephanie S. Zhang

On approximate robust confidence distributions

A confidence distribution is a complete tool for making frequentist inference for a parameter of interest $\psi$ based on an assumed parametric model. Indeed, it allows to reach point estimates, to assess their precision, to set up tests…

Methodology · Statistics 2022-12-20 Elena Bortolato , Laura Ventura

Confidence Regions for Parameters of Negative Binomial Distribution

We describe a general method for the construction of a confidence region for the two parameters of the Negative Binomial Distribution. This is achieved by expanding the sampling distribution of Method-of-Moments estimators, using the…

Statistics Theory · Mathematics 2016-12-28 Emmanuel Nkingi , Jan Vrbik

Reliable Programmatic Weak Supervision with Confidence Intervals for Label Probabilities

The accurate labeling of datasets is often both costly and time-consuming. Given an unlabeled dataset, programmatic weak supervision obtains probabilistic predictions for the labels by leveraging multiple weak labeling functions (LFs) that…

Machine Learning · Statistics 2025-08-07 Verónica Álvarez , Santiago Mazuelas , Steven An , Sanjoy Dasgupta

High Probability Lower Bounds for the Total Variation Distance

The statistics and machine learning communities have recently seen a growing interest in classification-based approaches to two-sample testing. The outcome of a classification-based two-sample test remains a rejection decision, which is not…

Statistics Theory · Mathematics 2022-11-15 Loris Michel , Jeffrey Näf , Nicolai Meinshausen

Distribution-Free Conditional Median Inference

We consider the problem of constructing confidence intervals for the median of a response $Y \in \mathbb{R}$ conditional on features $X \in \mathbb{R}^d$ in a situation where we are not willing to make any assumption whatsoever on the…

Statistics Theory · Mathematics 2021-09-07 Dhruv Medarametla , Emmanuel J. Candès

Tight Bounds on the Binomial CDF, and the Minimum of i.i.d Binomials, in terms of KL-Divergence

We provide finite sample upper and lower bounds on the Binomial tail probability which are a direct application of Sanov's theorem. We then use these to obtain high probability upper and lower bounds on the minimum of i.i.d. Binomial random…

Probability · Mathematics 2025-02-27 Xiaohan Zhu , Mesrob I. Ohannessian , Nathan Srebro

Data-driven Approximation of Distributionally Robust Chance Constraints using Bayesian Credible Intervals

The non-convexity and intractability of distributionally robust chance constraints make them challenging to cope with. From a data-driven perspective, we propose formulating it as a robust optimization problem to ensure that the…

Optimization and Control · Mathematics 2023-06-23 Zhiping Chen , Wentao Ma , Bingbing Ji

Confidence intervals with a priori parameter bounds

We review the methods of constructing confidence intervals that account for a priori information about one-sided constraints on the parameter being estimated. We show that the so-called method of sensitivity limit yields a correct solution…

Data Analysis, Statistics and Probability · Physics 2015-05-20 A. V. Lokhov , F. V. Tkachov

Robust Validation: Confident Predictions Even When Distributions Shift

While the traditional viewpoint in machine learning and statistics assumes training and testing samples come from the same population, practice belies this fiction. One strategy -- coming from robust statistics and optimization -- is thus…

Machine Learning · Statistics 2024-07-08 Maxime Cauchois , Suyash Gupta , Alnur Ali , John C. Duchi