统计理论 — Scifaro

Interpolation of functionals of stochastic sequences with stationary increments from observations with noise

The problem of optimal estimation of linear functional ${{A}_{N}}\xi =\sum\limits_{k=0}^{N}{a(k)\xi (k)}\,$ depending on the unknown values of a stochastic sequence $\xi (m)$ with stationary $n$-th increments from observations of the…

统计理论 · 数学 2025-10-28 Maksym Luz , Mykhailo Moklyachuk

Confidence Sets for Multidimensional Scaling

We develop a formal statistical framework for classical multidimensional scaling (CMDS) applied to noisy dissimilarity data. We establish distributional convergence results for the embeddings produced by CMDS for various noise models, which…

统计理论 · 数学 2025-10-28 Siddharth Vishwanath , Ery Arias-Castro

Metric Entropy and Minimax Risk of Ellipsoids with an Application to Pinsker's Theorem

We study how large an $\ell^2$ ellipsoid is by introducing type-$\tau$ integrals that capture the average decay of its semi-axes. These integrals turn out to be closely related to standard complexity measures: we show that the metric…

统计理论 · 数学 2025-10-28 Thomas Allard

Sequential monitoring for distributional changepoint using degenerate U-statistics

We investigate the online detection of changepoints in the distribution of a sequence of observations using degenerate U-statistic-type processes. We study weighted versions of: an ordinary, CUSUM-type scheme, a Page-CUSUM-type scheme, and…

统计理论 · 数学 2025-10-28 Cooper Boniece , Lajos Horvath , Lorenzo Trapani

On hypoellipticity of degenerate operators in testing and detection problems

We study a class of degenerate diffusion generators that arise in sequential testing and quickest detection problems with partial information. The observation process is driven by $k$ independent Brownian motions, while the hidden state…

统计理论 · 数学 2025-10-28 Erhan Bayraktar , Yuqiong Wang

Optimal Nuisance Function Tuning for Estimating a Doubly Robust Functional under Proportional Asymptotics

In this paper, we explore the asymptotically optimal tuning parameter choice in ridge regression for estimating nuisance functions of a statistical functional that has recently gained prominence in conditional independence testing and…

统计理论 · 数学 2025-10-28 Sean McGrath , Debarghya Mukherjee , Rajarshi Mukherjee , Zixiao Jolene Wang

Uniform central limit theorems for non-stationary processes via relative weak convergence

Statistical inference for non-stationary data is hindered by the failure of classical central limit theorems (CLTs), not least because there is no fixed Gaussian limit to converge to. To resolve this, we introduce relative weak convergence,…

统计理论 · 数学 2025-10-28 Nicolai Palm , Thomas Nagler

On the Contractivity of Stochastic Interpolation Flow

We investigate stochastic interpolation, a recently introduced framework for high dimensional sampling which bears many similarities to diffusion modeling. Stochastic interpolation generates a data sample by first randomly initializing a…

统计理论 · 数学 2025-10-28 Mara Daniels

Ask for More Than Bayes Optimal: A Theory of Indecisions for Classification

Selective classification is a powerful tool for automated decision-making in high-risk scenarios, allowing classifiers to act only when confident and abstain when uncertainty is high. Given a target accuracy, our goal is to minimize…

统计理论 · 数学 2025-10-28 Mohamed Ndaoud , Peter Radchenko , Bradley Rava

Higher criticism for rare and weak non-proportional hazard deviations in survival analysis

We propose a method for comparing survival data based on the higher criticism of p-values obtained from multiple exact hypergeometric tests. The method accommodates non-informative right-censorship and is sensitive to hazard differences in…

统计理论 · 数学 2025-10-28 Alon Kipnis , Ben Galili , Zohar Yakhini

Thresholded Lasso for high dimensional variable selection

Given $n$ noisy samples with $p$ dimensions, where $n \ll p$, we show that the multi-step thresholding procedure based on the Lasso -- we call it the {\it Thresholded Lasso}, can accurately estimate a sparse vector $\beta \in {\mathbb R}^p$…

统计理论 · 数学 2025-10-28 Shuheng Zhou

Graphical Finite Population Sampling

This paper introduces an innovative and intuitive finite population sampling method that has been developed using a unique graphical framework. In this approach, first-order inclusion probabilities are represented as bars on a…

统计理论 · 数学 2025-10-28 Bardia Panahbehagh

Weighted residual empirical processes, martingale transformations, and model specification tests for regressions with diverging number of parameters

This paper explores hypothesis testing for the parametric forms of the mean and variance functions in regression models under diverging-dimension settings. To mitigate the curse of dimensionality, we introduce weighted residual empirical…

统计理论 · 数学 2025-10-28 Falong Tan , Xu Guo , Lixing Zhu

Scale estimation and rate-unbiasedness for Gaussian processes under smoothness misspecification

Gaussian process regression is used throughout statistics and machine learning for prediction and uncertainty quantification. A Gaussian process is specified by its mean and covariance functions. Many covariance functions, including…

统计理论 · 数学 2025-10-28 Toni Karvonen , François Bachoc

R\'enyi entropy for multivariate controlled autoregressive moving average systems

R\'enyi entropy is an important measure in the context of information theory as a generalization of Shannon entropy. This information measure was often used for uncertainty quantification of dynamical behaviour of stochastic processes. In…

统计理论 · 数学 2025-10-28 Salah H. Abid , Uday J. Quaez , Javier E. Contreras-Reyescor

Conditional Forecasts and Proper Scoring Rules for Reliable and Accurate Performative Predictions

Performative predictions are forecasts which influence the outcomes they aim to predict, undermining the existence of correct forecasts and standard methods of elicitation and estimation. We show that conditioning forecasts on covariates…

统计理论 · 数学 2025-10-27 Philip Boeken , Onno Zoeter , Joris M. Mooij

Kriging measure-valued data with sparse observations: application to nuclear safety studies

This work addresses the interpolation of probability measures within a spatial statistics framework. We develop a Kriging approach in the Wasserstein space, leveraging the quantile function representation of the one-dimensional Wasserstein…

统计理论 · 数学 2025-10-27 Florian Gossard , François Bachoc , Jean Baccou , Thibaut Le Gouic , Jacques Liandrat , Tony Glantz

A Geometric Analysis of PCA

What property of the data distribution determines the excess risk of principal component analysis? In this paper, we provide a precise answer to this question. We establish a central limit theorem for the error of the principal subspace…

统计理论 · 数学 2025-10-27 Ayoub El Hanchi , Murat Erdogdu , Chris Maddison

PAC-Bayes Bounds on Variational Tempered Posteriors for Markov Models

Datasets displaying temporal dependencies abound in science and engineering applications, with Markov models representing a simplified and popular view of the temporal dependence structure. In this paper, we consider Bayesian settings that…

统计理论 · 数学 2025-10-27 Imon Banerjee , Vinayak A. Rao , Harsha Honnappa

Bootstrap Consistency for Empirical Likelihood in Density Ratio Models

We establish the validity of bootstrap methods for empirical likelihood (EL) inference under the density ratio model (DRM). In particular, we prove that the bootstrap maximum EL estimators share the same limiting distribution as their…

统计理论 · 数学 2025-10-24 Weiwei Zhuang , Weiqi Yang , Jiahua Chen