统计理论 — Scifaro

Black-Box Model Confidence Sets Using Cross-Validation with High-Dimensional Gaussian Comparison

We derive high-dimensional Gaussian comparison results for the standard $V$-fold cross-validated risk estimates. Our results combine a recent stability-based argument for the low-dimensional central limit theorem of cross-validation with…

统计理论 · 数学 2023-11-15 Nicholas Kissel , Jing Lei

Optimal Estimation of Large-Dimensional Nonlinear Factor Models

This paper studies optimal estimation of large-dimensional nonlinear factor models. The key challenge is that the observed variables are possibly nonlinear functions of some latent variables where the functional forms are left unspecified.…

统计理论 · 数学 2023-11-14 Yingjie Feng

The Bessel function expression of characteristic function

The purpose of the present paper is to give unified expressions to the characteristic functions of all elliptical and related distributions. Those distributions including the multivariate elliptical symmetric distributions and some…

统计理论 · 数学 2023-11-14 Chuancun Yin , Hua Dong

Asymptotic in a class of network models with an increasing sub-Gamma degree sequence

For the differential privacy under the sub-Gamma noise, we derive the asymptotic properties of a class of network models with binary values with a general link function. In this paper, we release the degree sequences of the binary networks…

统计理论 · 数学 2023-11-14 Jing Luo , Haoyu Wei , Xiaoyu Lei , Jiaxin Guo

Limiting distributions of graph-based test statistics on sparse and dense graphs

Two-sample tests utilizing a similarity graph on observations are useful for high-dimensional and non-Euclidean data due to their flexibility and good performance under a wide range of alternatives. Existing works mainly focused on sparse…

统计理论 · 数学 2023-11-14 Yejiong Zhu , Hao Chen

A central limit theorem for the Benjamini-Hochberg false discovery proportion under a factor model

The Benjamini-Hochberg (BH) procedure remains widely popular despite having limited theoretical guarantees in the commonly encountered scenario of correlated test statistics. Of particular concern is the possibility that the method could…

统计理论 · 数学 2023-11-14 Dan M. Kluger , Art B. Owen

Objective Bayesian Analysis for the Differential Entropy of the Gamma Distribution

The present paper introduces a fully objective Bayesian analysis to obtain the posterior distribution of an entropy measure. Notably, we consider the gamma distribution, which describes many natural phenomena in physics, engineering, and…

统计理论 · 数学 2023-11-14 Eduardo Ramos , Osafu A. Egbon , Pedro L. Ramos , Francisco A. Rodrigues , Francisco Louzada

A new goodness of fit test for normal distribution based on Stein's characterization

In this paper, we develop a simple non-parametric test for testing normal distribution based on the distance between empirical zero-bias transformation and empirical distribution. The asymptotic properties of the test statistic are studied.…

统计理论 · 数学 2023-11-14 Sudheesh Kattumannil

Statistical inference on $D^{(d)}(u_n)$ condition and estimation of the Extremal Index

Clustering of extreme events can have profound and detrimental societal consequences. The extremal index, a number in the unit interval, is a key parameter in modelling the clustering of extremes. The study of extremal index often assumes a…

统计理论 · 数学 2023-11-14 Juan Juan Cai

Step and Smooth Decompositions as Topological Clustering

We investigate a class of recovery problems for which observations are a noisy combination of continuous and step functions. These problems can be seen as non-injective instances of non-linear ICA with direct applications to image…

统计理论 · 数学 2023-11-13 Luciano Vinas , Arash A. Amini

Robust covariance estimation with missing values and cell-wise contamination

Large datasets are often affected by cell-wise outliers in the form of missing or erroneous data. However, discarding any samples containing outliers may result in a dataset that is too small to accurately estimate the covariance matrix.…

统计理论 · 数学 2023-11-13 Karim Lounici , Grégoire Pacreau

On a Projection Estimator of the Regression Function Derivative

In this paper, we study the estimation of the derivative of a regression function in a standard univariate regression model. The estimators are defined either by derivating nonparametric least-squares estimators of the regression function…

统计理论 · 数学 2023-11-13 Fabienne Comte , Nicolas Marie

Detangling robustness in high dimensions: composite versus model-averaged estimation

Robust methods, though ubiquitous in practice, are yet to be fully understood in the context of regularized estimation and high dimensions. Even simple questions become challenging very quickly. For example, classical statistical theory…

统计理论 · 数学 2023-11-10 Jing Zhou , Gerda Claeskens , Jelena Bradic

Adaptive Linear Estimating Equations

Sequential data collection has emerged as a widely adopted technique for enhancing the efficiency of data gathering processes. Despite its advantages, such data collection mechanism often introduces complexities to the statistical inference…

统计理论 · 数学 2023-11-09 Mufang Ying , Koulik Khamaru , Cun-Hui Zhang

Laplace and Saddlepoint Approximations in High Dimensions

We examine the behaviour of the Laplace and saddlepoint approximations in the high-dimensional setting, where the dimension of the model is allowed to increase with the number of observations. Approximations to the joint density, the…

统计理论 · 数学 2023-11-09 Yanbo Tang , Nancy Reid

Controlling FSR in Selective Classification

Uncertainty quantification and false selection error rate (FSR) control are crucial in many high-consequence scenarios, so we need models with good interpretability. This article introduces the optimality function for the binary…

统计理论 · 数学 2023-11-08 Guanlan Zhao , Zhonggen Su

Thresholding the higher criticism test statistics for optimality in a heterogeneous setting

Donoho and Kipnis (2022) showed that the the higher criticism (HC) test statistic has a non-Gaussian phase transition but remarked that it is probably not optimal, in the detection of sparse differences between two large frequency tables…

统计理论 · 数学 2023-11-08 Hock Peng Chan

Hebbian learning inspired estimation of the linear regression parameters from queries

Local learning rules in biological neural networks (BNNs) are commonly referred to as Hebbian learning. [26] links a biologically motivated Hebbian learning rule to a specific zeroth-order optimization method. In this work, we study a…

统计理论 · 数学 2023-11-08 Johannes Schmidt-Hieber , Wouter M Koolen

Wasserstein contraction and spectral gap of slice sampling revisited

We propose a new class of Markov chain Monte Carlo methods, called $k$-polar slice sampling ($k$-PSS), as a technical tool that interpolates between and extrapolates beyond uniform and polar slice sampling. By examining Wasserstein…

统计理论 · 数学 2023-11-08 Philip Schär

Dimension-independent spectral gap of polar slice sampling

Polar slice sampling, a Markov chain construction for approximate sampling, performs, under suitable assumptions on the target and initial distribution, provably independent of the state space dimension. We extend the aforementioned result…

统计理论 · 数学 2023-11-08 Daniel Rudolf , Philip Schär