Related papers: The Threshold Breakdown Point

Yet another breakdown point notion: EFSBP - illustrated at scale-shape models

The breakdown point in its different variants is one of the central notions to quantify the global robustness of a procedure. We propose a simple supplementary variant which is useful in situations where we have no obvious or only partial…

Methodology · Statistics 2015-03-17 Peter Ruckdeschel , Nataliya Horbenko

The Robustness of Estimator Composition

We formalize notions of robustness for composite estimators via the notion of a breakdown point. A composite estimator successively applies two (or more) estimators: on data decomposed into disjoint parts, it applies the first estimator on…

Machine Learning · Computer Science 2016-09-06 Pingfan Tang , Jeff M. Phillips

Robustness to missing data: breakdown point analysis

Missing data is pervasive in econometric applications, and rarely is it plausible that the data are missing (completely) at random. This paper proposes a methodology for studying the robustness of results drawn from incomplete datasets.…

Econometrics · Economics 2025-12-29 Daniel Ober-Reynolds

Bias robustness of depth estimators in multivariate settings

The concept of statistical depth extends the notions of the median and quantiles to other statistical models. These procedures aim to formalize the idea of identifying deeply embedded fits to a model that are less influenced by…

Statistics Theory · Mathematics 2026-05-11 Jorge G. Adrover , Marcelo Ruiz

Asymptotic Breakdown Point Analysis for a General Class of Minimum Divergence Estimators

Robust inference based on the minimization of statistical divergences has proved to be a useful alternative to classical techniques based on maximum likelihood and related methods. Basu et al. (1998) introduced the density power divergence…

Statistics Theory · Mathematics 2025-02-17 Subhrajyoty Roy , Abir Sarkar , Abhik Ghosh , Ayanendranath Basu

Asymptotic breakdown point analysis of the minimum density power divergence estimator under independent non-homogeneous setups

The minimum density power divergence estimator (MDPDE) has gained significant attention in the literature of robust inference due to its strong robustness properties and high asymptotic efficiency; it is relatively easy to compute and can…

Statistics Theory · Mathematics 2025-09-16 Suryasis Jana , Subhrajyoty Roy , Ayanendranath Basu , Abhik Ghosh

Breakdown points for maximum likelihood estimators of location-scale mixtures

ML-estimation based on mixtures of Normal distributions is a widely used tool for cluster analysis. However, a single outlier can make the parameter estimation of at least one of the mixture components break down. Among others, the…

Statistics Theory · Mathematics 2007-06-13 Christian Hennig

Template Matching and Change Point Detection by M-estimation

We consider the fundamental problem of matching a template to a signal. We do so by M-estimation, which encompasses procedures that are robust to gross errors (i.e., outliers). Using standard results from empirical process theory, we derive…

Statistics Theory · Mathematics 2020-09-10 Ery Arias-Castro , Lin Zheng

Trimming Stability Selection increases variable selection robustness

Contamination can severely distort an estimator unless the estimation procedure is suitably robust. This is a well-known issue and has been addressed in Robust Statistics, however, the relation of contamination and distorted variable…

Statistics Theory · Mathematics 2022-07-15 Tino Werner

Sample Complexity Bounds for Robust Mean Estimation with Mean-Shift Contamination

We study the basic task of mean estimation in the presence of mean-shift contamination. In the mean-shift contamination model, an adversary is allowed to replace a small constant fraction of the clean samples by samples drawn from…

Machine Learning · Computer Science 2026-02-27 Ilias Diakonikolas , Giannis Iakovidis , Daniel M. Kane , Sihan Liu

The broken sample problem revisited: Proof of a conjecture by Bai-Hsing and high-dimensional extensions

We revisit the classical broken sample problem: Two samples of i.i.d. data points $\mathbf{X}=\{X_1,\cdots, X_n\}$ and $\mathbf{Y}=\{Y_1,\cdots,Y_m\}$ are observed without correspondence with $m\leq n$. Under the null hypothesis,…

Statistics Theory · Mathematics 2025-03-20 Simiao Jiao , Yihong Wu , Jiaming Xu

Breakdown Properties of the M-Estimators of Multivariate Scatter

The M-estimators of multivariate scatter are known to have breakdown points no greater than 1/(p+1), where p is the dimension of the data. In high dimension, the breakdown points are usually considered to be disappointingly low. This paper…

Statistics Theory · Mathematics 2014-06-20 David E. Tyler

Exact Computation of Minimum Sample Size for Estimation of Binomial Parameters

It is a common contention that it is an ``impossible mission'' to exactly determine the minimum sample size for the estimation of a binomial parameter with prescribed margin of error and confidence level. In this paper, we investigate such…

Statistics Theory · Mathematics 2007-08-02 Xinjia Chen

High finite-sample efficiency and robustness based on distance-constrained maximum likelihood

Good robust estimators can be tuned to combine a high breakdown point and a specified asymptotic efficiency at a central model. This happens in regression with MM- and tau-estimators among others. However, the finite-sample efficiency of…

Statistics Theory · Mathematics 2013-11-21 Ricardo Maronna , Víctor Yohai

The exact amount of t-ness that the normal model can tolerate

Suppose that the normal model is used for data $Y_1,\ldots,Y_n$, but that the true distribution is a t-distribution with location and scale parameters $\xi$ and $\sigma$ and $m$ degrees of freedom. The normal model corresponds to…

Methodology · Statistics 2026-03-31 Nils Lid Hjort

An Automatic Finite-Sample Robustness Metric: When Can Dropping a Little Data Make a Big Difference?

Study samples often differ from the target populations of inference and policy decisions in non-random ways. Researchers typically believe that such departures from random sampling -- due to changes in the population over time and space, or…

Methodology · Statistics 2023-07-20 Tamara Broderick , Ryan Giordano , Rachael Meager

A change-point problem and inference for segment signals

We address the problem of detection and estimation of one or two change-points in the mean of a series of random variables. We use the formalism of set estimation in regression: To each point of a design is attached a binary label that…

Statistics Theory · Mathematics 2018-09-07 Victor-Emmanuel Brunel

Truncating the Exponential with a Uniform Distribution

For a sample of Exponentially distributed durations we aim at point estimation and a confidence interval for its parameter. A duration is only observed if it has ended within a certain time interval, determined by a Uniform distribution.…

Methodology · Statistics 2021-10-19 Rafael Weißbach , Dominik Wied

The Sample Complexity of Lossless Data Compression

A new framework is introduced for examining and evaluating the fundamental limits of lossless data compression, that emphasizes genuinely non-asymptotic results. The {\em sample complexity} of compressing a given source is defined as the…

Information Theory · Computer Science 2026-04-16 Terence Viaud , Ioannis Kontoyiannis

Adversarially robust change point detection

Change point detection is becoming increasingly popular in many application areas. On one hand, most of the theoretically-justified methods are investigated in an ideal setting without model violations, or merely robust against identical…

Methodology · Statistics 2021-10-26 Mengchu Li , Yi Yu