统计理论 — Scifaro

Distributional regression with reject option

Selective prediction, where a model has the option to abstain from making a decision, is crucial for machine learning applications in which mistakes are costly. In this work, we focus on distributional regression and introduce a framework…

统计理论 · 数学 2025-04-01 Ahmed Zaoui , Clément Dombry

Finite sample valid confidence sets of mode

Estimating the mode of a unimodal distribution is a classical problem in statistics. Although there are several approaches for point-estimation of mode in the literature, very little has been explored about the interval-estimation of mode.…

统计理论 · 数学 2025-04-01 Manit Paul , Arun Kumar Kuchibhotla

On Finite Time Span Estimators of Parameters for Ornstein-Uhlenbeck Processes

We study the bias and the mean-squared error of the maximum likelihood estimators (MLE) of parameters associated with a two-parameter mean-reverting process for a finite time $T$. Using the likelihood ratio process, we derive the…

统计理论 · 数学 2025-04-01 Jun S. Han , Nino Kordzakhia

Tracy-Widom, Gaussian, and Bootstrap: Approximations for Leading Eigenvalues in High-Dimensional PCA

Under certain conditions, the largest eigenvalue of a sample covariance matrix undergoes a well-known phase transition when the sample size $n$ and data dimension $p$ diverge proportionally. In the subcritical regime, this eigenvalue has…

统计理论 · 数学 2025-04-01 Nina Dörnemann , Miles E. Lopes

General reproducing properties in RKHS with application to derivative and integral operators

In this paper, we consider the reproducing property in Reproducing Kernel Hilbert Spaces (RKHS). We establish a reproducing property for the closure of the class of combinations of composition operators under minimal conditions. This allows…

统计理论 · 数学 2025-04-01 Fatima-Zahrae El-Boukkouri , Josselin Garnier , Olivier Roustant

On universal inference in Gaussian mixture models

A recent line of work provides new statistical tools based on game-theory and achieves safe anytime-valid inference without assuming regularity conditions. In particular, the framework of universal inference proposed by Wasserman, Ramdas…

统计理论 · 数学 2025-04-01 Hongjian Shi , Mathias Drton

Unsupervised domain adaptation under hidden confounding

We introduce a new predictive mechanism that operates in the presence of hidden confounding across distributionally diverse data sources while ensuring consistent estimation of causal parameters-despite their recognized suboptimality for…

统计理论 · 数学 2025-04-01 Carlos García Meixide , David Ríos Insua

Asymptotic theory for Bayesian inference and prediction: from the ordinary to a conditional Peaks-Over-Threshold method

The Peaks Over Threshold (POT) method is the most popular statistical method for the analysis of univariate extremes. Even though there is a rich applied literature on Bayesian inference for the POT, the asymptotic theory for such proposals…

统计理论 · 数学 2025-04-01 Clément Dombry , Simone A. Padoan , Stefano Rizzelli

Phase-type frailty models: A flexible approach to modeling unobserved heterogeneity in survival analysis

Frailty models are essential tools in survival analysis for addressing unobserved heterogeneity and random effects in the data. These models incorporate a random effect, the frailty, which is assumed to impact the hazard rate…

统计理论 · 数学 2025-04-01 Jorge Yslas

Asymptotic Behavior of Principal Component Projections for Multivariate Extremes

The extremal dependence structure of a regularly varying $d$-dimensional random vector can be described by its angular measure. The standard nonparametric estimator of this measure is the empirical measure of the observed angles of the $k$…

统计理论 · 数学 2025-03-31 Holger Drees

The maximum likelihood type estimator of SDEs with fractional Brownian motion under small noise asymptotics in the rough case

We study the problem of parametric estimation for continuously observed stochastic differential equation driven by fractional Brownian motion. Under some assumptions on drift and diffusion coefficients, we construct maximum likelihood…

统计理论 · 数学 2025-03-31 Shohei Nakajima

Adversarially Robust Topological Inference

The distance function to a compact set plays a crucial role in the paradigm of topological data analysis. In particular, the sublevel sets of the distance function are used in the computation of persistent homology -- a backbone of the…

统计理论 · 数学 2025-03-31 Siddharth Vishwanath , Bharath K. Sriperumbudur , Kenji Fukumizu , Satoshi Kuriki

Use of copula functions in error assessment due to deviation from dependence assumption

In this paper, we analyze the relative errors in various reliability measures due to the tacit assumption that the components associated with a $n$-component series system or a parallel system are independently working where the components…

统计理论 · 数学 2025-03-28 Subarna Bhattacharjee , Aninda Kumar Nanda , Subhashree Patra

Use of stochastic orders and statistical dependence in error analysis for multi-component system

In this paper, we analyze the relative errors that crop up in the various reliability measures due to the tacit assumption that the components are independently working associated with a $n$-component series system or a parallel system…

统计理论 · 数学 2025-03-28 Subarna Bhattacharjee , Aninda Kumar Nanda , Subhashree Patra

Variable selection via thresholding

Variable selection comprises an important step in many modern statistical inference procedures. In the regression setting, when estimators cannot shrink irrelevant signals to zero, covariates without relationships to the response often…

统计理论 · 数学 2025-03-28 Ka Long Keith Ho , Hien Duy Nguyen

On statistical and causal models associated with acyclic directed mixed graphs

Causal models in statistics are often described using acyclic directed mixed graphs (ADMGs), which contain directed and bidirected edges and no directed cycles. This article surveys various interpretations of ADMGs, discusses their…

统计理论 · 数学 2025-03-28 Qingyuan Zhao

Clustered Switchback Designs for Experimentation Under Spatio-temporal Interference

We consider experimentation in the presence of non-stationarity, inter-unit (spatial) interference, and carry-over effects (temporal interference), where we wish to estimate the global average treatment effect (GATE), the difference between…

统计理论 · 数学 2025-03-28 Su Jia , Nathan Kallus , Christina Lee Yu

Revisiting general source condition in learning over a Hilbert space

In Learning Theory, the smoothness assumption on the target function (known as source condition) is a key factor in establishing theoretical convergence rates for an estimator. The existing general form of the source condition, as discussed…

统计理论 · 数学 2025-03-27 Naveen Gupta , S. Sivananthan

Nonparametric MLE for Gaussian Location Mixtures: Certified Computation and Generic Behavior

We study the nonparametric maximum likelihood estimator $\widehat{\pi}$ for Gaussian location mixtures in one dimension. It has been known since (Lindsay, 1983) that given an $n$-point dataset, this estimator always returns a mixture with…

统计理论 · 数学 2025-03-27 Yury Polyanskiy , Mark Sellke

Functional structural equation models with out-of-sample guarantees

Statistical learning methods typically assume that the training and test data originate from the same distribution, enabling effective risk minimization. However, real-world applications frequently involve distributional shifts, leading to…

统计理论 · 数学 2025-03-27 Philip Kennerberg , Ernst C. Wit