统计理论 — Scifaro

Decoupling and randomization for double-indexed permutation statistics

This paper introduces a version of decoupling and randomization to establish concentration inequalities for double-indexed permutation statistics. The results yield, among other applications, a new combinatorial Hanson-Wright inequality and…

统计理论 · 数学 2026-03-23 Mingxuan Zou , Jingfan Xu , Peng Ding , Fang Han

Sample complexity for divergence regularized optimal transport with radial cost

We prove a new sample complexity result for divergence regularized optimal transport. Our bound holds for probability measures on~$\mathbb{R}^d$ with exponential tail decay and for radial cost functions that satisfy a local Lipschitz…

统计理论 · 数学 2026-03-23 Ruiyu Han , Johannes Wiesel

Adversarial Estimation of Assortment Probabilities under Independence Structure

We consider the problem of estimating assortment probabilities, which is common in operations management applications, including product bundling, advertising, etc. Existing approaches typically model each assortment as a category and apply…

统计理论 · 数学 2026-03-23 Alexandre Belloni , Yan Chen , Matthew Harding

On the relation between likelihood ratios and p-values for testing success probabilities of Bernoulli trials

It is well known that there is no direct one-to-one relation between $p$-values and likelihood ratios or Bayes factors, since their relation crucially involves the sample size $n$. We investigate their (asymptotic) relation in a…

统计理论 · 数学 2026-03-23 Wouter Kager , Ronald Meester

A genuine test for hyperuniformity

We introduce a rigorous and sensitive significance test for hyperuniformity that yields reliable results even from a single sample. Our approach is based on a detailed analysis of the empirical Fourier transform of a stationary point…

统计理论 · 数学 2026-03-23 Michael A. Klatt , Günter Last , Norbert Henze

Finite-sample bounds for multi-output system identification

This paper presents uniform-in-time finite-sample bounds for regularized linear regression with vector-valued outputs and conditionally zero-mean subgaussian noise. By revisiting classical self-normalized martingale arguments, we obtain…

统计理论 · 数学 2026-03-20 Léo Simpson , Katrin Baumgärtner , Johannes Köhler , Moritz Diehl

Sometimes nonparametrics beat parametrics, even when the model is right

A basic issue in both teaching of and practice of statistics is the interplay between modelling assumptions and inference performance. The general message conveyed is that stronger assumptions lead to better statistical performance of the…

统计理论 · 数学 2026-03-20 Morten Byholt , Nils Lid Hjort

Approximation by mixtures of multivariate Erlang distributions

We prove that finite multivariate Erlang mixture densities with a common rate parameter are dense in the class of probability densities on $\mathbb{R}_{+}^{d}$ that belong to $L^{p}$, for every dimension $d\in\mathbb{N}$ and every $1\le…

统计理论 · 数学 2026-03-20 Hien Duy Nguyen

The minimax optimal convergence rate of posterior density in the weighted orthogonal polynomials

We investigate Bayesian nonparametric density estimation via orthogonal polynomial expansions in weighted Sobolev spaces. A core challenge is establishing minimax optimal posterior convergence rates, especially for densities on unbounded…

统计理论 · 数学 2026-03-20 Yiqi Luo , Xue Luo

Minimax Optimal Estimation of Mean and Covariance Functions with Spectral Regularization

Estimation of the mean and covariance functions is a fundamental problem in functional data analysis, particularly for discretely observed functional data. In this work, we study a regularization-based framework for estimating the mean and…

统计理论 · 数学 2026-03-20 Naveen Gupta , Bharath K Sriperumbudur

Highly Adaptive Empirical Risk Minimization with Principal Components

The Highly Adaptive Lasso (HAL) delivers unprecedented guarantees in nonparametric minimum loss estimation under minimal smoothness assumptions, such as dimension-free minimax optimal rates. However, the practical use of HAL has been…

统计理论 · 数学 2026-03-20 Carlos García Meixide , Mingxun Wang , Alejandro Schuler , Mark J. van der Laan

$K-$means with learned metrics

We study the Fr\'echet $k-$means of a metric measure space when both the measure and the distance are unknown and have to be estimated. We prove a general result that states that the $k-$means are continuous with respect to the measured…

统计理论 · 数学 2026-03-20 Pablo Groisman , Matthieu Jonckheere , Jordan Serres , Mariela Sued

The Pivotal Information Criterion

The Bayesian and Akaike information criteria aim at finding a good balance between under- and over-fitting. They are extensively used every day by practitioners. Yet we contend they suffer from at least two afflictions: their penalty…

统计理论 · 数学 2026-03-20 Sylvain Sardy , Maxime van Cutsem , Sara van de Geer

Optimal rates for density and mode estimation with expand-and-sparsify representations

Expand-and-sparsify representations are a class of theoretical models that capture sparse representation phenomena observed in the sensory systems of many animals. At a high level, these representations map an input $x \in \mathbb{R}^d$ to…

统计理论 · 数学 2026-03-20 Kaushik Sinha , Christopher Tosh

Bayesian Prediction under Moment Conditioning

Prediction is a central task of statistics and machine learning, yet many inferential settings provide only partial information, typically in the form of moment constraints or estimating equations. We develop a finite, fully Bayesian…

统计理论 · 数学 2026-03-20 Nicholas G. Polson , Daniel Zantedeschi

Identifiability and Estimation in Continuous Lyapunov Models

Cross-sectional observations from a dynamical system can be modeled via steady-state distributions of Markov processes. The major challenge is then to determine whether the process parameters can be identified and estimated from the…

统计理论 · 数学 2026-03-19 Cecilie Olesen Recke , Niels Richard Hansen

Tessellation Localized Transfer learning for nonparametric regression

Transfer learning aims to improve performance on a target task by leveraging information from related source tasks. We propose a nonparametric regression transfer learning framework that explicitly models heterogeneity in the source-target…

统计理论 · 数学 2026-03-19 Hélène Halconruy , Benjamin Bobbia , Paul Lejamtel

The Honest Truth About Causal Trees: Accuracy Limits for Heterogeneous Treatment Effect Estimation

Recursive decision trees are widely used to estimate heterogeneous causal treatment effects in experimental and observational studies. These methods are typically implemented using CART-type recursive partitioning and are often viewed as…

统计理论 · 数学 2026-03-19 Matias D. Cattaneo , Jason M. Klusowski , Ruiqi Rae Yu

Identifiability of VAR(1) model in a stationary setting

We consider a classical First-order Vector AutoRegressive (VAR(1)) model, where we interpret the autoregressive interaction matrix as influence relationships among the components of the VAR(1) process that can be encoded by a weighted…

统计理论 · 数学 2026-03-19 Bixuan Liu

On Separability of Covariance in Multiway Data Analysis

Multiway data analysis aims to uncover patterns in data structured as multi-indexed arrays, with multiway covariance playing a crucial role in many applications. However, the high dimensionality of multiway covariance presents significant…

统计理论 · 数学 2026-03-19 Dogyoon Song , Alfred O. Hero