统计理论 — Scifaro

PCA for Point Processes

We introduce a novel statistical framework for the analysis of replicated point processes that allows for the study of point pattern variability at a population level. By treating point process realizations as random measures, we adopt a…

统计理论 · 数学 2025-11-05 Franck Picard , Vincent Rivoirard , Angelina Roche , Victor Panaretos

Learning extremal graphical structures in high dimensions

Extremal graphical models encode the conditional independence structure of multivariate extremes. Key statistics for learning extremal graphical structures are empirical extremal variograms, for which we prove non-asymptotic concentration…

统计理论 · 数学 2025-11-05 Sebastian Engelke , Michaël Lalancette , Stanislav Volgushev

Theoretical analysis of phase-rectified signal averaging (PRSA) algorithm

Phase-rectified signal averaging (PRSA) is a widely used algorithm to analyze nonstationary biomedical time series. The method operates by identifying hinge points in the time series according to prescribed rules, extracting segments…

统计理论 · 数学 2025-11-04 Jiro Akahori , Joseph Najnudel , Hau-Tieng Wu , Ju-Yi Yen

Consistent estimation in subcritical birth-and-death processes

We investigate parameter estimation in subcritical continuous-time birth-and-death processes with multiple births. We show that the classical maximum likelihood estimators for the model parameters, based on the continuous observation of a…

统计理论 · 数学 2025-11-04 Sophie Hautphenne , Emma Horton

Stochastic comparisons of finite mixtures with general exponentiated location-scale distributed components

In this paper, we study stochastic ordering results between two finite mixtures with single and multiple outliers, assuming subpopulations follow general exponentiated location-scale distributions. For single-outlier mixtures, several…

统计理论 · 数学 2025-11-04 Raju Bhakta , Kaushik Gupta , Ghobad Saadat Kia , Suchandan Kayal

An LRD spectral test for irregularly discretely observed contaminated functional time series in manifolds

A statistical hypothesis test for long range dependence (LRD) in functional time series in manifolds has been formulated in Ruiz-Medina and Crujeiras (2025) in the spectral domain for fully observed functional data. The asymptotic Gaussian…

统计理论 · 数学 2025-11-04 M. D. Ruiz-Medina , R. M. Crujeiras

On Kernels and Covariance Structures in Hilbert Space Gaussian Processes

Motivated by practical applications, I present a novel and comprehensive framework for operator-valued positive definite kernels. This framework is applied to both operator theory and stochastic processes. The first application focuses on…

统计理论 · 数学 2025-11-04 Saeed Hashemi Sababe

Adaptive Algorithms for Infinitely Many-Armed Bandits: A Unified Framework

We consider a bandit problem where the buget is smaller than the number of arms, which may be infinite. In this regime, the usual objective in the literature is to minimize simple regret. To analyze broad classes of distributions with…

统计理论 · 数学 2025-11-04 Emmanuel Pilliat

Dynamical mean-field analysis of adaptive Langevin diffusions: Replica-symmetric fixed point and empirical Bayes

In many applications of statistical estimation via sampling, one may wish to sample from a high-dimensional target distribution that is adaptively evolving to the samples already seen. We study an example of such dynamics, given by a…

统计理论 · 数学 2025-11-04 Zhou Fan , Justin Ko , Bruno Loureiro , Yue M. Lu , Yandi Shen

Dynamical mean-field analysis of adaptive Langevin diffusions: Propagation-of-chaos and convergence of the linear response

Motivated by an application to empirical Bayes learning in high-dimensional regression, we study a class of Langevin diffusions in a system with random disorder, where the drift coefficient is driven by a parameter that continuously adapts…

统计理论 · 数学 2025-11-04 Zhou Fan , Justin Ko , Bruno Loureiro , Yue M. Lu , Yandi Shen

Testing Random Effects for Binomial Data

In modern scientific research, small-scale studies with limited participants are increasingly common. However, interpreting individual outcomes can be challenging, making it standard practice to combine data across studies using random…

统计理论 · 数学 2025-11-04 Lucas Kania , Larry Wasserman , Sivaraman Balakrishnan

On the breakdown point of transport-based quantiles

Recent work has used optimal transport ideas to generalize the notion of (center-outward) quantiles to dimension $d\geq 2$. We study the robustness properties of these transport-based quantiles by deriving their breakdown point, roughly,…

统计理论 · 数学 2025-11-04 Marco Avella-Medina , Alberto González-Sanz

Conditional uncorrelation equals independence

We show that the stochastic independence of real-valued random variables is equivalent to the conditional uncorrelation, where the conditioning takes place over the Cartesian products of intervals. Next, we express the mutual independence…

统计理论 · 数学 2025-11-04 Dawid Tarłowski

Curse of Dimensionality on Persistence Diagrams

The stability of persistent homology has led to wide applications of the persistence diagram as a trusted topological descriptor in the presence of noise. However, with the increasing demand for high-dimension and low-sample-size data…

统计理论 · 数学 2025-11-04 Yasuaki Hiraoka , Yusuke Imoto , Shu Kanazawa , Enhao Liu

Separation rates for the detection of synchronization of interacting point processes in a mean field frame. Application to neuroscience

Permutation tests have been proposed by Albert et al. (2015) to detect dependence between point processes, modeling in particular spike trains, that is the time occurrences of action potentials emitted by neurons. Our present work focuses…

统计理论 · 数学 2025-11-04 Josué Tchouanti , Éva Löcherbach , Patricia Reynaud-Bouret , Etienne Tanré

On the Variance, Admissibility, and Stability of Empirical Risk Minimization

It is well known that Empirical Risk Minimization (ERM) may attain minimax suboptimal rates in terms of the mean squared error (Birg\'e and Massart, 1993). In this paper, we prove that, under relatively mild assumptions, the suboptimality…

统计理论 · 数学 2025-11-04 Gil Kur , Eli Putterman , Alexander Rakhlin

A smooth transition from Wishart to GOE

It is well known that an $n \times n$ Wishart matrix with $d$ degrees of freedom is close to the appropriately centered and scaled Gaussian Orthogonal Ensemble (GOE) if $d$ is large enough. Recent work of Bubeck, Ding, Eldan, and Racz, and…

统计理论 · 数学 2025-11-04 Miklos Z. Racz , Jacob Richey

Advanced Distribution Theory for Significance in Scale Space

Smoothing methods find signals in noisy data. A challenge for Statistical inference is the choice of smoothing parameter. SiZer addressed this challenge in one-dimension by detecting significant slopes across multiple scales, but was not a…

统计理论 · 数学 2025-11-03 Rui Liu , Jan Hannig , J. S. Marron

Testing and estimation in orthosymmetric Gaussian sequence model

We study the Gaussian sequence model, i.e. $X \sim N(\mathbf{\theta}, I_\infty)$, where $\mathbf{\theta} \in \Gamma \subset \ell_2$ is assumed to be convex and compact. We show that goodness-of-fit testing sample complexity is lower bounded…

统计理论 · 数学 2025-11-03 Zeyu Jia , Yury Polyanskiy

Adversarially robust clustering with optimality guarantees

We consider the problem of clustering data points coming from sub-Gaussian mixtures. Existing methods that provably achieve the optimal mislabeling error, such as the Lloyd algorithm, are usually vulnerable to outliers. In contrast,…

统计理论 · 数学 2025-11-03 Soham Jana , Kun Yang , Sanjeev Kulkarni