统计理论 — Scifaro

Estimation after selection from bivariate normal population using LINEX loss function

Let $\pi_1$ and $\pi_2$ be two independent populations, where the population $\pi_i$ follows a bivariate normal distribution with unknown mean vector $\boldsymbol{\theta}^{(i)}$ and common known variance-covariance matrix $\Sigma$, $i=1,2$.…

统计理论 · 数学 2024-08-29 Mohd. Arshad , Omer Abdalghani , Kalu Ram Meena

Multicomponent stress strength reliability estimation for Pareto distribution based on upper record values

In this article, inferences about the multicomponent stress strength reliability are drawn under the assumption that strength and stress follow independent Pareto distribution with different shapes $(\alpha_1,\alpha_2)$ and common scale…

统计理论 · 数学 2024-08-29 Qazi Azhad Jamal , Mohd. Arshad , Nancy Khandelwal

Persistence Diagram Estimation : Beyond Plug-in Approaches

Persistent homology is a tool from Topological Data Analysis (TDA) used to summarize the topology underlying data. It can be conveniently represented through persistence diagrams. Observing a noisy signal, common strategies to infer its…

统计理论 · 数学 2024-08-28 Hugo Henneuse

Rational Maximum Likelihood Estimators of Kronecker Covariance Matrices

As is the case for many curved exponential families, the computation of maximum likelihood estimates in a multivariate normal model with a Kronecker covariance structure is typically carried out with an iterative algorithm, specifically, a…

统计理论 · 数学 2024-08-28 Mathias Drton , Alexandros Grosdos , Andrew McCormack

Learning Topic Hierarchies by Tree-Directed Latent Variable Models

We study a parametric family of latent variable models, namely topic models, equipped with a hierarchical structure among the topic variables. Such models may be viewed as a finite mixture of the latent Dirichlet allocation (LDA) induced…

统计理论 · 数学 2024-08-27 Sunrit Chakraborty , Rayleigh Lei , XuanLong Nguyen

Anti-Concentration Inequalities for the Difference of Maxima of Gaussian Random Vectors

We derive novel anti-concentration bounds for the difference between the maximal values of two Gaussian random vectors across various settings. Our bounds are dimension-free, scaling with the dimension of the Gaussian vectors only through…

统计理论 · 数学 2024-08-27 Alexandre Belloni , Ethan X. Fang , Shuting Shen

Estimating Lagged (Cross-)Covariance Operators of $L^p$-$m$-approximable Processes in Cartesian Product Hilbert Spaces

Estimating parameters of functional ARMA, GARCH and invertible processes requires estimating lagged covariance and cross-covariance operators of Cartesian product Hilbert space-valued processes. Asymptotic results have been derived in…

统计理论 · 数学 2024-08-27 Sebastian Kühnert

Spectral Recovery in the Labeled SBM

We consider the problem of exact community recovery in the Labeled Stochastic Block Model (LSBM) with $k$ communities, where each pair of vertices is associated with a label from the set $\{0,1, \dots, L\}$. A pair of vertices from…

统计理论 · 数学 2024-08-26 Julia Gaudio , Heming Liu

Real Log Canonical Thresholds at Non-singular Points

Recent advances have clarified theoretical learning accuracy in Bayesian inference, revealing that the asymptotic behavior of metrics such as generalization loss and free energy, assessing predictive accuracy, is dictated by a rational…

统计理论 · 数学 2024-08-26 Yuki Kurumadani

Rates of strong uniform consistency for the $k$-nearest neighbors kernel estimators of density and regression function

We adress the problem of consistency of the $k$-nearest neighbors kernel estimators of the density and the regression function in the multivariate case. We get the rates of strong uniform consistency on the whole space $\mathbb{R}^p$ for…

统计理论 · 数学 2024-08-26 Luran Bengono Mintogo , Emmanuel de Dieu Nkou , Guy Martial Nkiet

Generalized Estimation and Information

This paper extends the idea of a generalized estimator for a scalar parameter (Vos, 2022) to multi-dimensional parameters both with and without nuisance parameters. The title reflects the fact that generalized estimators provide more than…

统计理论 · 数学 2024-08-26 Paul Vos , Qiang Wu

Monge-Kantorovich superquantiles and expected shortfalls with applications to multivariate risk measurements

We propose center-outward superquantile and expected shortfall functions, with applications to multivariate risk measurements, extending the standard notion of value at risk and conditional value at risk from the real line to…

统计理论 · 数学 2024-08-26 Bernard Bercu , Jeremie Bigot , Gauthier Thurin

Posterior Sampling in High Dimension via Diffusion Processes

Sampling from the posterior is a key technical problem in Bayesian statistics. Rigorous guarantees are difficult to obtain for Markov Chain Monte Carlo algorithms of common use. In this paper, we study an alternative class of algorithms…

统计理论 · 数学 2024-08-26 Andrea Montanari , Yuchen Wu

Split Conformal Prediction and Non-Exchangeable Data

Split conformal prediction (CP) is arguably the most popular CP method for uncertainty quantification, enjoying both academic interest and widespread deployment. However, the original theoretical analysis of split CP makes the crucial…

统计理论 · 数学 2024-08-26 Roberto I. Oliveira , Paulo Orenstein , Thiago Ramos , João Vitor Romano

Some improved Gaussian correlation inequalities for symmetrical n-rectangles extended to some multivariate gamma distributions and some further probability inequalities

The Gaussian correlation inequality (GCI) for symmetrical n-rectangles is improved if the absolute components have a joint cumulative distribution (cdf) which is MTP2 (multivariate totally positive of order 2). Inequalities of the here…

统计理论 · 数学 2024-08-26 Thomas Royen

Factor Adjusted Spectral Clustering for Mixture Models

This paper studies a factor modeling-based approach for clustering high-dimensional data generated from a mixture of strongly correlated variables. Statistical modeling with correlated structures pervades modern applications in economics,…

统计理论 · 数学 2024-08-23 Shange Tang , Soham Jana , Jianqing Fan

Statistical inference on kurtosis of elliptical distributions

Multivariate elliptically-contoured distributions are widely used for modeling correlated and non-Gaussian data. In this work, we study the kurtosis of the elliptical model, which is an important parameter in many statistical analysis.…

统计理论 · 数学 2024-08-23 Bowen Zhou , Peirong Xu , Cheng Wang

Small Sample Behavior of Wasserstein Projections, Connections to Empirical Likelihood, and Other Applications

The empirical Wasserstein projection (WP) distance quantifies the Wasserstein distance from the empirical distribution to a set of probability measures satisfying given expectation constraints. The WP is a powerful tool because it mitigates…

统计理论 · 数学 2024-08-22 Sirui Lin , Jose Blanchet , Peter Glynn , Viet Anh Nguyen

Is Cross-Validation the Gold Standard to Evaluate Model Performance?

Cross-Validation (CV) is the default choice for evaluating the performance of machine learning models. Despite its wide usage, their statistical benefits have remained half-understood, especially in challenging nonparametric regimes. In…

统计理论 · 数学 2024-08-22 Garud Iyengar , Henry Lam , Tianyu Wang

General Inferential Limits Under Differential and Pufferfish Privacy

Differential privacy (DP) is a class of mathematical standards for assessing the privacy provided by a data-release mechanism. This work concerns two important flavors of DP that are related yet conceptually distinct: pure…

统计理论 · 数学 2024-08-22 James Bailie , Ruobin Gong