统计理论 — Scifaro

Minimax unbiased estimation for finite populations with bounded outcomes

We study design-unbiased estimation of the finite-population total $\sum_{i=1}^N y_i$ when each outcome satisfies known bounds $y_i\in[a_i,b_i]$. For any sampling design with inclusion probabilities $\pi_i>0$, we prove a sharp lower bound…

统计理论 · 数学 2026-05-21 P. M. Aronow , Patrick Lopatto

Kernel Density Estimation under $C^{1,1}$ Regularity: AMISE, Weak Curvature, and Plug-in Bandwidths

Classical kernel density estimation usually derives the AMISE and optimal bandwidth from a pointwise Taylor expansion, which requires twice continuous differentiability. This assumption is stronger than necessary and excludes natural…

统计理论 · 数学 2026-05-21 Alireza Kabgani , Elaheh Lotfian

Finite-Sample Bounds for Expected Signature Estimation under Weak Dependence

The expected signature uniquely determines the law of a random rough path under a moment-growth condition, yet finite-sample bounds for estimating it from a single long dependent trajectory have been lacking. We study a stationary…

统计理论 · 数学 2026-05-21 Bryson Schenck

Variational Optimality of F\"ollmer Processes in Generative Diffusions

We construct and analyze generative diffusions that transport a point mass to a prescribed target distribution over a finite time horizon using the stochastic interpolant framework. The drift is expressed as a conditional expectation that…

统计理论 · 数学 2026-05-21 Yifan Chen , Eric Vanden-Eijnden

An improved central limit theorem for the empirical sliced Wasserstein distance

Wasserstein distances are widely used in modern data analysis but pose significant computational and statistical challenges in high dimensions. The sliced Wasserstein distance alleviates these challenges by leveraging one-dimensional…

统计理论 · 数学 2026-05-21 David Rodríguez-Vítores , Eustasio del Barrio , Jean-Michel Loubes

Drift estimation for rough processes under small noise asymptotic : trajectory fitting method

We consider a process $X^\ve$ that solves a stochastic Volterra equation with an unknown parameter $\theta^\star$ in the drift function. The Volterra kernel is singular, and includes as an example, $K\_0(u)=c u^{\alpha-1/2} \id{u>0}$ with…

统计理论 · 数学 2026-05-21 Arnaud Gloter , Nakahiro Yoshida

Limit theorems of matching estimators with a fixed number of matches

This paper re-examines the limit theorems of Abadie and Imbens for nearest-neighbor matching estimators of average treatment effects with a fixed number of matches. We establish, for the first time, a non-normalized central limit theorem…

统计理论 · 数学 2026-05-21 Songliang Chen , Fang Han

A Goodness-of-Fit Test for Independent Component Models in High Dimensions

Independent component (IC) models are a standard tool for representing multivariate data in statistics, signal processing, and machine learning. Despite the extensive use of IC models, much less attention has been given to goodness-of-fit…

统计理论 · 数学 2026-05-20 Mingshuo Liu , Siyao Wang , Miles E. Lopes

Error Bounds for Importance Sampling with Estimated Proposal Distributions

Importance sampling with data-driven proposal distributions is widely used in practice. A common workflow first generates an auxiliary sample of size $N$ from an approximation of the target distribution, constructs a density estimate $\hat…

统计理论 · 数学 2026-05-20 Cathrine Aeckerle-Willems , Ilja Klebanov , Simon Weissmann

Uniform projection designs under the stratified $L_2$-discrepancy

This paper studies a uniform projection criterion for space-filling designs under the stratified $L_2$-discrepancy. The criterion, denoted by $\Phi_{SD}$, is the average squared stratified $L_2$-discrepancy over all two-dimensional…

统计理论 · 数学 2026-05-20 Sixu Liu , Yaping Wang

Influence as soft sparsity: Estimation of monotone functions on $\{0,1\}^d$

We study the problem of estimating a monotone function $f:\{0,1\}^d\to[0,1]$ from noisy observations at uniformly random vertices of the Boolean hypercube. As a measure of complexity for the target~$f$, we use the total $L^1$-influence…

统计理论 · 数学 2026-05-20 Gérard Biau

Optimal Spectral Algorithms for Correlated Two-view Models in High Dimensions

We study high-dimensional inference in correlated two-view models, focusing on spectral methods for strong detection and weak recovery. We introduce a general framework, motivated by a TAP type heuristic from statistical physics, that…

统计理论 · 数学 2026-05-20 Hang Du , Henry Hu , Saba Lepsveridze

Inference Functionals and Observation Operators for Distributional Statistical Models

This paper generalises inference functions (Godambe, 1960) to distributional statistical models, in which each probability measure is represented by a distribution--kernel pair $(T_\theta, \varphi) \in \mathcal S'(\mathbb R) \times \mathcal…

统计理论 · 数学 2026-05-20 R. Labouriau

Minimax optimal submatrix detection: Sharp non-asymptotic rates

Given an observation $\mathbf Y \in \mathbb{R}^{d_1\times d_2}$ from the model $\mathbf Y = \mathbf X + \mathbf E$ where $\mathbf X$ is constant and $\mathbf E$ has i.i.d. $N(0,1)$ entries, we consider the problem of detecting a planted…

统计理论 · 数学 2026-05-20 Parker Knight , Julien Chhor

Entropic Strict Minimum Message Length and Its Connections to PAC-Bayes and NML

We introduce entropic strict minimum message length (SMML), a risk-sensitive generalization of strict minimum message length coding. The proposed criterion replaces expected two-part codelength under the prior predictive distribution with…

统计理论 · 数学 2026-05-20 Enes Makalic , Daniel F. Schmidt

Asymptotic properties of the multivariate Sz\'{a}sz-Mirakyan estimator for cumulative distribution functions on the nonnegative orthant

The asymptotic properties of multivariate Sz\'{a}sz-Mirakyan estimators for cumulative distribution functions (cdf) supported on the nonnegative orthant are investigated. Explicit bias and variance expansions are derived on compact subsets…

统计理论 · 数学 2026-05-20 Guanjie Lyu , Frédéric Ouimet , Cindy Feng

Drift estimation for rough processes under small noise asymptotic : QMLE approach

We consider a process $X^\ve$ solution of a stochastic Volterra equation with an unknown parameter $\theta^\star$ in the drift function. The Volterra kernel is singular near zero, exhibiting a behavior comparable to $K\_0(u)=cu^{\alpha-1}…

统计理论 · 数学 2026-05-20 Arnaud Gloter , Nakahiro Yoshida

The exact region and an inequality between Chatterjee's and Spearman's rank correlations

The rank correlation \xi(X,Y), recently established by Sourav Chatterjee and already popular in the statistics literature, takes values in [0,1], where 0 characterizes independence of X and Y, and 1 characterizes perfect dependence of Y on…

统计理论 · 数学 2026-05-20 Jonathan Ansari , Marcus Rockel

High-dimensional analysis of ridge regression for non-identically distributed data with a variance profile

High-dimensional linear regression has been thoroughly studied in the context of independent and identically distributed data. We propose to investigate high-dimensional regression models for independent but non-identically distributed…

统计理论 · 数学 2026-05-20 Jérémie Bigot , Issa-Mbenard Dabo , Camille Male

Sharp variance estimator and causal bootstrap in stratified randomized experiments

Randomized experiments are the gold standard for estimating treatment effects, and randomization serves as a reasoned basis for inference. In widely used stratified randomized experiments, randomization-based finite-population asymptotic…

统计理论 · 数学 2026-05-20 Haoyang Yu , Ke Zhu , Hanzhong Liu