统计理论 — Scifaro

Differentially Private Two-Stage Empirical Risk Minimization with Applications to Individualized Treatment Rule

Differential privacy provides a formal framework for releasing statistical estimators that limit how much any single observation can influence the output, by injecting calibrated random noise. We study differentially private estimation in…

统计理论 · 数学 2026-05-26 Joowon Lee , Guanhua Chen

Double Local-to-Unity: Inference under Nearly Nonstationary Volatility

This article develops a moderate-deviation limit theory for autoregressive models with jointly persistent mean and volatility dynamics. The autoregressive coefficient is allowed to drift toward unity slower than the classical 1/n rate,…

统计理论 · 数学 2026-05-26 Abir Sarkar , Martin T. Wells

Gaussian Approximation for High-Dimensional Second-Order $U$- and $V$-statistics with Size-Dependent Kernels under i.n.i.d. Sampling

We develop Gaussian approximations for high-dimensional vectors formed by second-order $U$- and $V$-statistics whose kernels depend on sample size under independent but not identically distributed (i.n.i.d.) sampling. Our results hold…

统计理论 · 数学 2026-05-26 Shunsuke Imai

A robust and scalable estimation for high-dimensional volatility models

This paper introduces a robust and computationally efficient estimation framework for high-dimensional volatility models in the BEKK-ARCH class. The proposed approach employs data truncation to ensure robustness against heavy-tailed…

统计理论 · 数学 2026-05-26 Kejun Chen , Yuchang Lin , Qianqian Zhu

Estimation of Population Linear Spectral Statistics by Marchenko--Pastur Inversion

A new method of estimating population linear spectral statistics from high-dimensional data is introduced. When the dimension $d$ grows with the sample size $n$ such that $\frac{d}{n} \to c>0$, the proposed method is the first with proven…

统计理论 · 数学 2026-05-26 Ben Deitmar

Asymptotic e-processes

We investigate the concept of an asymptotic e-process, which is a doubly-indexed stochastic process $(E_{m,n})_{m,n\in\mathbb{N}}$ that possesses, asymptotically for an approximation index $m\to\infty$, the properties of an e-process along…

统计理论 · 数学 2026-05-25 Pierre-François Massiani , Sebastian Schulze , Mattes Mollenhauer

The Integer-valued Moving-Average Random Field

An integer-valued moving average (INMA) model for count random fields is proposed and investigated. Closed-form expressions are derived for both its marginal distribution and spatial dependence structure, for arbitrary model order and also…

统计理论 · 数学 2026-05-25 Angelika Silbernagel , Christian H. Weiß

Causal inference via implied interventions

In the context of having an instrumental variable, the standard practice in causal inference begins by targeting an effect of interest and proceeds by formulating assumptions enabling its identification. We turn this around by adhering to…

统计理论 · 数学 2026-05-25 Carlos García Meixide , Mark J. van der Laan

Measures of association for approximating copulas

This paper studies closed-form expressions for multiple association measures of copulas commonly used for approximation purposes, including Bernstein, shuffle--of--min, checkerboard and check--min copulas. In particular, closed-form…

统计理论 · 数学 2026-05-25 Marcus Rockel

The feasibility of multi-graph alignment: a Bayesian approach

We establish thresholds for the feasibility of random multi-graph alignment in two models. In the Gaussian model, we demonstrate an "all-or-nothing" phenomenon: above a critical threshold, exact alignment is achievable with high…

统计理论 · 数学 2026-05-25 Louis Vassaux , Laurent Massoulié

A Circular Chatterjee's Correlation Coefficient

Chatterjee's rank correlation is a directed measure of association designed to detect whether one variable can be predicted as a function of another. While the original coefficient is naturally defined for real-valued data, circular data…

统计理论 · 数学 2026-05-22 Sourav Majumdar

Robust Statistical Estimators with Bounded Empirical Sensitivity

We introduce a new measure of robustness for statistical estimators, which we call \emph{empirical sensitivity}. An estimator $\hat \theta$ has bounded empirical sensitivity if, with high probability over a dataset $X = (X_1, \dots, X_n)…

统计理论 · 数学 2026-05-22 Valentio Iverson , Gautam Kamath , Argyris Mouzakis , Adam Smith

Batch learning equals online learning in Bayesian supervised learning

In this paper we study Bayesian supervised learning models proposed by L\^e in \cite{Le2025}. We show the existence of Bayesian inversions on universal Bayesian supervised learning models $(\mathcal{P}(\mathcal{Y})^{\mathcal{X}}, \mu,…

统计理论 · 数学 2026-05-22 Hông Vân Lê

Fast Wasserstein rates for estimating probability distributions of probabilistic graphical models

Using i.i.d. data to estimate a high-dimensional distribution in Wasserstein distance is a fundamental instance of the curse of dimensionality. We explore how structural knowledge about the data-generating process which gives rise to the…

统计理论 · 数学 2026-05-22 Daniel Bartl , Stephan Eckstein

Likelihood landscape of binary latent model on a tree

We investigate the optimization landscape of maximum likelihood estimation (MLE) for the Cavender-Farris-Neyman (CFN) model, a two-state latent tree model fundamental to statistical phylogenetics and the ferromagnetic Ising model. Although…

统计理论 · 数学 2026-05-22 David Clancy , Hanbaek Lyu , Sebastien Roch

Honest Inference for Stochastic Optimization

This manuscript studies a general approach to construct confidence sets for the solution of stochastic optimization, rendering empirical risk minimization as special cases. Statistical inference for stochastic optimization poses significant…

统计理论 · 数学 2026-05-22 Kenta Takatsu , Arun Kumar Kuchibhotla

Data driven extreme value distribution estimation: Derivation of the Mean Integrated Squared Error, optimal bandwidth selection and stability conditions

We introduce the data driven extreme value distribution (DDEVD) estimator, a kernel-based method for estimating extreme value distributions from data. We derive its mean integrated squared error (MISE) in detail, use it to compute the…

统计理论 · 数学 2026-05-21 Michael Sandbichler , Tobias Hell

$L^2$ over Wasserstein: Statistical Analysis for Optimal Transport

Optimal transport provides an inherently geometric and highly structured framework for studying spaces of probability measures, supplying a rich theoretical toolkit for contemporary statistics, machine learning, and generative modelling. In…

统计理论 · 数学 2026-05-21 Riccardo Passeggeri , Rohan M. Shenoy , Pengcheng Ye

Linear Functional Testing with General Loadings in Sparse Regression: Separation Rates and Computational Barriers

We study the problem of testing $H_0: \xi^\top\beta=t_0$ in high-dimensional sparse linear regression with Gaussian random design and unknown design covariance. The loading vector $\xi$ is arbitrary, and the exact sparsity level $k$ is…

统计理论 · 数学 2026-05-21 Jie Xie , Dongming Huang

Revisiting the Misspecified Cram\'er-Rao Bound

Estimation under model misspecification arises in many signal processing problems, where the assumed observation model deviates from the true data-generating mechanism due to errors or simplifications. The misspecified Cram\'er-Rao bound…

统计理论 · 数学 2026-05-21 Malaak Khatib , Nadav Harel , Joseph Tabrikian , Tirza Routtenberg