统计理论 — Scifaro

Exact Distribution of the Noncentral Complex Roy's Largest Root Statistic via Pieri's Formula

In this study, we derive the exact distribution and moment of the noncentral complex Roy's largest root statistic, expressed as a product of complex zonal polynomials. We show that the linearization coefficients arising from the product of…

统计理论 · 数学 2025-07-30 Koki Shimizu , Hiroki Hashiguchi

False discovery rate control with compound p-values

In the setting of multiple testing, compound p-values generalize p-values by asking for superuniformity to hold only \emph{on average} across all true nulls. We study the properties of the Benjamini--Hochberg procedure applied to compound…

统计理论 · 数学 2025-07-30 Rina Foygel Barber , Richard J Samworth

A Generalized Cram\'er-Rao Bound Using Information Geometry

In information geometry, statistical models are considered as differentiable manifolds, where each probability distribution represents a unique point on the manifold. A Riemannian metric can be systematically obtained from a divergence…

统计理论 · 数学 2025-07-29 Satyajit Dhadumia , M. Ashok Kumar

A global Lipschitz stability perspective for understanding approximate approaches in Bayesian sequential learning

We establish a general, non-asymptotic error analysis framework for understanding the effects of incremental approximations made by practical approaches for Bayesian sequential learning (BSL) on their long-term inference performance. Our…

统计理论 · 数学 2025-07-29 Liliang Wang , Alex A. Gorodetsky

Uniform inference in linear mixed models

We provide finite-sample distribution approximations, that are uniform in the parameter, for inference in linear mixed models. Focus is on variances and covariances of random effects in cases where existing theory fails because their…

统计理论 · 数学 2025-07-29 Karl Oskar Ekvall , Matteo Bottai

State evolution beyond first-order methods I: Rigorous predictions and finite-sample guarantees

We develop a toolbox for exact analysis of iterative algorithms on a class of high-dimensional nonconvex optimization problems with random data. While prior work has shown that low-dimensional statistics of (generalized) first-order methods…

统计理论 · 数学 2025-07-29 Michael Celentano , Chen Cheng , Ashwin Pananjady , Kabir Aladin Verchand

Tree-structured Ising models under mean parameterization

We assess advantages of expressing tree-structured Ising models via their mean parameterization rather than their commonly chosen canonical parameterization. This includes fixedness of marginal distributions, often convenient for dependence…

统计理论 · 数学 2025-07-29 Benjamin Côté , Hélène Cossette , Etienne Marceau

Signal detection from spiked noise via asymmetrization

The signal plus noise model $H=S+Y$ is a fundamental model in signal detection when a low rank signal $S$ is polluted by noise $Y$. In the high-dimensional setting, one often uses the leading singular values and corresponding singular…

统计理论 · 数学 2025-07-29 Zhigang Bao , Kha Man Cheong , Jaehun Lee , Yuji Li

Early Stopping for Regression Trees

We develop early stopping rules for growing regression tree estimators. The fully data-driven stopping rule is based on monitoring the global residual norm. The best-first search and the breadth-first search algorithms together with linear…

统计理论 · 数学 2025-07-29 Ratmir Miftachov , Markus Reiß

Symmetric Perceptrons, Number Partitioning and Lattices

The symmetric binary perceptron ($\mathrm{SBP}_{\kappa}$) problem with parameter $\kappa : \mathbb{R}_{\geq1} \to [0,1]$ is an average-case search problem defined as follows: given a random Gaussian matrix $\mathbf{A} \sim…

统计理论 · 数学 2025-07-29 Neekon Vafa , Vinod Vaikuntanathan

Iterative Collaborative Filtering for Sparse Matrix Estimation

We consider sparse matrix estimation where the goal is to estimate an $n\times n$ matrix from noisy observations of a small subset of its entries. We analyze the estimation error of the popularly utilized collaborative filtering algorithm…

统计理论 · 数学 2025-07-29 Christian Borgs , Jennifer Chayes , Devavrat Shah , Christina Lee Yu

A Better Linear Unbiased Estimator for Averages over Discrete Structures

Given an i.i.d. sample drawn from some probability distribution on a finite set, the best (in the sense of least variance) linear unbiased estimator (BLUE) of the average of any quantity with respect to that distribution is the sample…

统计理论 · 数学 2025-07-28 Bastiaan J. Braams

On unbiased estimators for functions of the rate parameter of the exponential distribution

In this paper, we explicitly derive unbiased estimators for various functions of the rate parameter of the exponential distribution in the absence of a location parameter, including powers of the rate parameter, the $q$th quantile, the…

统计理论 · 数学 2025-07-28 Roberto Vila , Eduardo Yoshio Nakano

On determinantal point processes with nonsymmetric kernels

Determinantal point processes (DPPs for short) are a class of repulsive point processes. They have found some statistical applications to model spatial point pattern datasets with repulsion between close points. In the case of DPPs on…

统计理论 · 数学 2025-07-28 Poinas Arnaud

Average partial effect estimation using double machine learning

Single-parameter summaries of variable effects in regression settings are desirable for ease of interpretation. However (partially) linear models for example, which would deliver these, may fit poorly to the data. On the other hand, an…

统计理论 · 数学 2025-07-28 Harvey Klyne , Rajen D. Shah

An extended latent factor framework for ill-posed linear regression

In many applications, particularly in the natural sciences, the available high-dimensional set of features may contain variables that are not correlated with the response under consideration. Such irrelevant features can, in certain cases,…

统计理论 · 数学 2025-07-28 Gianluca Finocchio , Tatyana Krivobokova

Trek-Based Parameter Identification for Linear Causal Models With Arbitrarily Structured Latent Variables

We develop a criterion to certify whether causal effects are identifiable in linear structural equation models with latent variables. Linear structural equation models correspond to directed graphs whose nodes represent the random variables…

统计理论 · 数学 2025-07-25 Nils Sturma , Mathias Drton

Robust and Smooth Estimation of the Extreme Tail Index via Weighted Minimum Density Power Divergence

By introducing a weight function into the density power divergence, we develop a new class of robust and smooth estimators for the tail index of Pareto-type distributions, offering improved efficiency in the presence of outliers. These…

统计理论 · 数学 2025-07-25 Saida Mancer , Abdelhakim Necir , Djamel Meraghni

Unbiased estimation in one-parameter exponential families for the inverse of the natural parameter with extensions

For one-parameter continuous exponential families, we identify an unbiased estimator of the inverse of the natural parameter $\theta$ for cases where $\theta > 0$, extending an earlier result of \cite{voinov1985unbiased} applicable to a…

统计理论 · 数学 2025-07-25 Pankaj Bhagwat , Eric Marchand

Frequentist Asymptotics of Variational Laplace

Variational inference is a general framework to obtain approximations to the posterior distribution in a Bayesian context. In essence, variational inference entails an optimization over a given family of probability distributions to choose…

统计理论 · 数学 2025-07-24 Janis Keck