统计理论 — Scifaro

Clustering risk in Non-parametric Hidden Markov and I.I.D. Models

We conduct an in-depth analysis of the Bayes risk of clustering in the context of Hidden Markov and i.i.d. models. In both settings, we identify the situations where this risk is comparable to the Bayes risk of classification and those…

统计理论 · 数学 2025-05-28 Elisabeth Gassiat , Ibrahim Kaddouri , Zacharie Naulet

Gaussian Process Methods for Covariate-Based Intensity Estimation

We study nonparametric Bayesian inference for the intensity function of a covariate-driven point process. We extend recent results from the literature, showing that a wide class of Gaussian priors, combined with flexible link functions,…

统计理论 · 数学 2025-05-27 Patric Dolmeta , Matteo Giordano

Existence of the solution to the graphical lasso

The graphical lasso (glasso) is an $l_1$ penalised likelihood estimator for a Gaussian precision matrix. A benefit of the glasso is that it exists even when the sample covariance matrix is not positive definite but only positive…

统计理论 · 数学 2025-05-27 Jack Storror Carter

On a retarded stochastic system with discrete diffusion modeling life tables

This work proposes a method for modeling and forecasting mortality rates. It constitutes an improvement over previous studies by incorporating both the historical evolution of the mortality phenomenon and its random behavior. In the first…

统计理论 · 数学 2025-05-27 Tomás Caraballo , Francisco Morillas , José Valero

Weighted Tail Random Variable: A Novel Framework with Stochastic Properties and Applications

This paper introduces a novel framework to construct the probability density function (PDF) of non-negative continuous random variables. The proposed framework uses two functions: one is the survival function (SF) of a non-negative…

统计理论 · 数学 2025-05-27 Sarikul Islam , Nitin Gupta

Sampling from Binary Quadratic Distributions via Stochastic Localization

Sampling from binary quadratic distributions (BQDs) is a fundamental but challenging problem in discrete optimization and probabilistic inference. Previous work established theoretical guarantees for stochastic localization (SL) in…

统计理论 · 数学 2025-05-27 Chenguang Wang , Kaiyuan Cui , Weichen Zhao , Tianshu Yu

Nonparametric estimation of sliced inverse regression by the $ k$-nearest neighbors kernel method

We investigate nonparametric estimation of sliced inverse regression (SIR) via the $k$-nearest neighbors approach with a kernel. An estimator of the covariance matrix of the conditional expectation of the explanatory random vector given the…

统计理论 · 数学 2025-05-27 Luran Bengono Mintogo , Emmanuel de Dieu Nkou , Guy Martial Nkiet

Distributional Limit Theory for Optimal Transport

Optimal Transport (OT) is a resource allocation problem with applications in biology, data science, economics and statistics, among others. In some of the applications, practitioners have access to samples which approximate the continuous…

统计理论 · 数学 2025-05-27 Eustasio del Barrio , Alberto González-Sanz , Jean-Michel Loubes , David Rodríguez-Vítores

Optimal community detection in dense bipartite graphs

We consider the problem of detecting a community of densely connected vertices in a high-dimensional bipartite graph of size $n_1 \times n_2$. Under the null hypothesis, the observed graph is drawn from a bipartite Erd\H{o}s-Renyi…

统计理论 · 数学 2025-05-27 Julien Chhor , Parker Knight

Posterior Consistency in Parametric Models via a Tighter Notion of Identifiability

We study Bayesian posterior consistency in parametric density models with proper priors, challenging the perception that the problem is settled. Classical results established consistency via MLE convergence under regularity and…

统计理论 · 数学 2025-05-27 Nicola Bariletto , Bernardo Flores , Stephen G. Walker

Nonparametric estimation of the multivariate Spearman's footrule: a further discussion

In this paper, we propose two new estimators of the multivariate rank correlation coefficient Spearman's footrule which are based on two general estimators for Average Orthant Dependence measures. We compare the new proposals with a…

统计理论 · 数学 2025-05-27 Ana Pérez , Mercedes Prieto-Alaiz , Fernando Chamizo , Eckhard Liebscher , Manuel Úbeda-Flores

Phase transitions for the existence of unregularized M-estimators in single index models

This paper studies phase transitions for the existence of unregularized M-estimators under proportional asymptotics where the sample size $n$ and feature dimension $p$ grow proportionally with $n/p \to \delta \in (1, \infty)$. We study the…

统计理论 · 数学 2025-05-27 Takuya Koriyama , Pierre C. Bellec

The Sample Complexity of Simple Binary Hypothesis Testing

The sample complexity of simple binary hypothesis testing is the smallest number of i.i.d.\ samples required to distinguish between two distributions $p$ and $q$ in either: (i) the prior-free setting, with type-I error at most $\alpha$ and…

统计理论 · 数学 2025-05-27 Ankit Pensia , Varun Jog , Po-Ling Loh

Spectral clustering algorithm for the allometric extension model

The spectral clustering algorithm is often used as a binary clustering method for unclassified data by applying the principal component analysis. To study theoretical properties of the algorithm, the assumption of conditional…

统计理论 · 数学 2025-05-27 Kohei Kawamoto , Yuichi Goto , Koji Tsukuda

Perturbation Analysis of Randomized SVD and its Applications to Statistics

Randomized singular value decomposition (RSVD) is a class of computationally efficient algorithms for computing the truncated SVD of large data matrices. Given an $m \times n$ matrix $\widehat{{\mathbf M}}$, the prototypical RSVD algorithm…

统计理论 · 数学 2025-05-27 Yichi Zhang , Minh Tang

Optimal Decision Rules for Composite Binary Hypothesis Testing under Neyman-Pearson Framework

The composite binary hypothesis testing problem within the Neyman-Pearson framework is considered. The goal is to maximize the expectation of a nonlinear function of the detection probability, integrated with respect to a given probability…

统计理论 · 数学 2025-05-26 Yanglei Song , Berkan Dulek , Sinan Gezici

Minimax Rate-Optimal Algorithms for High-Dimensional Stochastic Linear Bandits

We study the stochastic linear bandit problem with multiple arms over $T$ rounds, where the covariate dimension $d$ may exceed $T$, but each arm-specific parameter vector is $s$-sparse. We begin by analyzing the sequential estimation…

统计理论 · 数学 2025-05-26 Jingyu Liu , Yanglei Song

On Fisher Consistency of Surrogate Losses for Optimal Dynamic Treatment Regimes with Multiple Categorical Treatments per Stage

Patients with chronic diseases often receive treatments at multiple time points, or stages. Our goal is to learn the optimal dynamic treatment regime (DTR) from longitudinal patient data. When both the number of stages and the number of…

统计理论 · 数学 2025-05-26 Nilanjana Laha , Nilson Chapagain , Victoria Cicherski , Aaron Sonabend-W

Statistical depth and support medians for fuzzy data

Statistical depth functions order the elements of a space with respect to their centrality in a probability distribution or dataset. Since many depth functions are maximized in the real line by the median, they provide a natural approach to…

统计理论 · 数学 2025-05-26 Luis González-De La Fuente , Alicia Nieto-Reyes , Pedro Terán

Lower Complexity Adaptation for Empirical Entropic Optimal Transport

Entropic optimal transport (EOT) presents an effective and computationally viable alternative to unregularized optimal transport (OT), offering diverse applications for large-scale data analysis. In this work, we derive novel statistical…

统计理论 · 数学 2025-05-26 Michel Groppe , Shayan Hundrieser