统计理论 — Scifaro

Effective regions and kernels in continuous sparse regularisation, with application to sketched mixtures

This paper advances the general theory of continuous sparse regularisation on measures with the Beurling-LASSO (BLASSO). This TV-regularised convex program on the space of measures allows to recover a sparse measure using a noisy…

统计理论 · 数学 2025-10-15 Yohann De Castro , Rémi Gribonval , Nicolas Jouvin

Gaussian Approximation for High-Dimensional $U$-statistics with Size-Dependent Kernels

Motivated by small bandwidth asymptotics for kernel-based semiparametric estimators in econometrics, this paper establishes Gaussian approximation results for high-dimensional fixed-order $U$-statistics whose kernels depend on the sample…

统计理论 · 数学 2025-10-15 Shunsuke Imai , Yuta Koike

Power comparison of sequential testing by betting procedures

In this paper, we derive power guarantees of some sequential tests for bounded mean under general alternatives. We focus on testing procedures using nonnegative supermartingales which are anytime valid and consider alternatives which…

统计理论 · 数学 2025-10-15 Amaury Durand , Olivier Wintenberger

Characterizing extremal dependence on a hyperplane

In this paper, we characterize the extremal dependence of $d$ asymptotically dependent variables by a class of random vectors on the $(d-1)$-dimensional hyperplane perpendicular to the diagonal vector $\mathbf1=(1,\ldots,1)$. This…

统计理论 · 数学 2025-10-15 Phyllis Wan

Identifiability and Falsifiability: Two Challenges for Bayesian Model Expansion

We study the identifiability of parameters and falsifiability of predictions under the process of model expansion in a Bayesian setting. Identifiability is represented by the closeness of the posterior to the prior distribution and…

统计理论 · 数学 2025-10-15 Collin Cademartori

Simultaneous Frequentist Calibration of Confidence Regions for Multiple Functionals in Constrained Inverse Problems

Many scientific analyses require simultaneous comparison of multiple functionals of an unknown signal at once, calling for multidimensional confidence regions with guaranteed simultaneous frequentist under structural constraints (e.g.,…

统计理论 · 数学 2025-10-14 Pau Batlle , Pratik Patil , Michael Stanley , Javier Ruiz Lupon , Houman Owhadi , Mikael Kuusela

TWIN: Two window inspection for online change point detection

We propose a new class of sequential change point tests, both for changes in the mean parameter and in the overall distribution function. The methodology builds on a two-window inspection scheme (TWIN), which aggregates data into symmetric…

统计理论 · 数学 2025-10-14 Patrick Bastian , Tim Kutta

Geometric Ergodicity of Gibbs Algorithms for a Normal Model With a Global-Local Shrinkage Prior

We consider Gibbs samplers for a normal linear regression model with a global-local shrinkage prior and show that they produce geometrically ergodic Markov chains. First, under the horseshoe local prior and a three-parameter beta global…

统计理论 · 数学 2025-10-14 Yasuyuki Hamura

Uniformly most powerful tests in linear models

In the multiple regression model we prove that the coefficient t-test for a variable of interest is uniformly most powerful unbiased, with the other parameters considered nuisance. The proof is based on the theory of tests with…

统计理论 · 数学 2025-10-14 Razvan G. Romanescu

Universality of estimators for high-dimensional linear models with block dependency

We study the universality property of estimators for high-dimensional linear models, which implies that the distribution of estimators is independent of whether the covariates follow a Gaussian distribution. Recent developments in…

统计理论 · 数学 2025-10-14 Toshiki Tsuda , Masaaki Imaizumi

Robust mean change point testing in high-dimensional data with heavy tails

We study mean change point testing problems for high-dimensional data, with exponentially- or polynomially-decaying tails. In each case, depending on the $\ell_0$-norm of the mean change vector, we separately consider dense and sparse…

统计理论 · 数学 2025-10-14 Mengchu Li , Yudong Chen , Tengyao Wang , Yi Yu

MLE convergence speed to information projection of exponential family: Criterion for model dimension and sample size -- complete proof version--

For a parametric model of distributions, the closest distribution in the model to the true distribution located outside the model is considered. Measuring the closeness between two distributions with the Kullback-Leibler (K-L) divergence,…

统计理论 · 数学 2025-10-14 Yo Sheena

Generalized Taylor's Law for Dependent and Heterogeneous Heavy-Tailed Data

Taylor's law, also known as fluctuation scaling in physics and the power-law variance function in statistics, is an empirical pattern widely observed across fields including ecology, physics, finance, and epidemiology. It states that the…

统计理论 · 数学 2025-10-13 Pok Him Cheng , Joel E. Cohen , Hok Kan Ling , Sheung Chi Phillip Yam

Online and Offline Robust Multivariate Linear Regression

We consider the robust estimation of the parameters of multivariate Gaussian linear regression models. To this aim we consider robust version of the usual (Mahalanobis) least-square criterion, with or without Ridge regularization. We…

统计理论 · 数学 2025-10-13 Antoine Godichon-Baggioni , Stephane S. Robin , Laure Sansonnet

Hypothesis testing on invariant subspaces of non-diagonalizable matrices with applications to network statistics

We generalise the inference procedure for eigenvectors of symmetrizable matrices of Tyler (1981) to that of invariant and singular subspaces of non-diagonalizable matrices. Wald tests for invariant vectors and $t$-tests for their individual…

统计理论 · 数学 2025-10-13 Jérôme R. Simons

Computational and statistical lower bounds for low-rank estimation under general inhomogeneous noise

Recent work has generalized several results concerning the well-understood spiked Wigner matrix model of a low-rank signal matrix corrupted by additive i.i.d. Gaussian noise to the inhomogeneous case, where the noise has a variance profile.…

统计理论 · 数学 2025-10-10 Debsurya De , Dmitriy Kunisky

Navigating Sparsities in High-Dimensional Linear Contextual Bandits

High-dimensional linear contextual bandit problems remain a significant challenge due to the curse of dimensionality. Existing methods typically consider either the model parameters to be sparse or the eigenvalues of context covariance…

统计理论 · 数学 2025-10-10 Rui Zhao , Zihan Chen , Zemin Zheng

Adaptive Thresholds for Monitoring and Screening in Imbalanced Samples: Optimality and Boosting Sensitivity

Suppose (standardized) measurements or statistics are monitored to raise an alarm when a threshold is exceeded. Often, the underlying population is heterogenous with respect to important discrete variables and thus samples may consist of…

统计理论 · 数学 2025-10-10 Ansgar Steland

Beyond independent component analysis: identifiability and algorithms

Independent Component Analysis (ICA) is a classical method for recovering latent variables with useful identifiability properties. For independent variables, cumulant tensors are diagonal; relaxing independence yields tensors whose zero…

统计理论 · 数学 2025-10-10 Alvaro Ribot , Anna Seigal , Piotr Zwiernik

Conditional distributions for the nested Dirichlet process via sequential imputation

We consider an array of random variables, taking values in a complete and separable metric space, that exhibits a kind of symmetry which we call row exchangeability. Given such an array, a natural model for Bayesian nonparametric inference…

统计理论 · 数学 2025-10-10 Evan Donald , Jason Swanson