统计理论 — Scifaro

Moment-Type Estimators for the Dirichlet and the Multivariate Gamma Distributions

This study presents new closed-form estimators for the Dirichlet and the Multivariate Gamma distribution families, whose maximum likelihood estimator cannot be explicitly derived. The methodology builds upon the score-adjusted estimators…

统计理论 · 数学 2023-11-28 Ioannis Oikonomidis , Samis Trevezas

The geometry of the maximum likelihood of Cauchy-like distributions

A simple way of obtaining robust estimates of the "center" (or the "location") and of the "scatter" of a dataset is to use the maximum likelihood estimate with a class of heavy-tailed distributions, regardless of the "true" distribution…

统计理论 · 数学 2023-11-28 Pavol Ševera

On the near-optimality of betting confidence sets for bounded means

Constructing nonasymptotic confidence intervals (CIs) for the mean of a univariate distribution from independent and identically distributed (i.i.d.) observations is a fundamental task in statistics. For bounded observations, a classical…

统计理论 · 数学 2023-11-28 Shubhanshu Shekhar , Aaditya Ramdas

Reducing sequential change detection to sequential estimation

We consider the problem of sequential change detection, where the goal is to design a scheme for detecting any changes in a parameter or functional $\theta$ of the data stream distribution that has small detection delay, but guarantees…

统计理论 · 数学 2023-11-28 Shubhanshu Shekhar , Aaditya Ramdas

Factor Augmented Sparse Throughput Deep ReLU Neural Networks for High Dimensional Regression

This paper introduces a Factor Augmented Sparse Throughput (FAST) model that utilizes both latent factors and sparse idiosyncratic components for nonparametric regression. The FAST model bridges factor models on one end and sparse…

统计理论 · 数学 2023-11-28 Jianqing Fan , Yihong Gu

Asymptotic Bounds for Smoothness Parameter Estimates in Gaussian Process Interpolation

It is common to model a deterministic response function, such as the output of a computer experiment, as a Gaussian process with a Mat\'ern covariance kernel. The smoothness parameter of a Mat\'ern kernel determines many important…

统计理论 · 数学 2023-11-28 Toni Karvonen

Variable Selection with the Knockoffs: Composite Null Hypotheses

The fixed-X knockoff filter is a flexible framework for variable selection with false discovery rate (FDR) control in linear models with arbitrary design matrices (of full column rank) and it allows for finite-sample selective inference via…

统计理论 · 数学 2023-11-28 Mehrdad Pournaderi , Yu Xiang

Nonparametric Estimation for SDE with Sparsely Sampled Paths: an FDA Perspective

We consider the problem of nonparametric estimation of the drift and diffusion coefficients of a Stochastic Differential Equation (SDE), based on $n$ independent replicates $\left\{X_i(t)\::\: t\in [0,1]\right\}_{1 \leq i \leq n}$, observed…

统计理论 · 数学 2023-11-28 Neda Mohammadi , Leonardo Santoro , Victor M. Panaretos

Bootstrapping Persistent Betti Numbers and Other Stabilizing Statistics

The present contribution investigates multivariate bootstrap procedures for general stabilizing statistics, with specific application to topological data analysis. Existing limit theorems for topological statistics prove difficult to use in…

统计理论 · 数学 2023-11-28 Benjamin Roycraft , Johannes Krebs , Wolfgang Polonik

More Power by using Fewer Permutations

It is conventionally believed that a permutation test should ideally use all permutations. If this is computationally unaffordable, it is believed one should use the largest affordable Monte Carlo sample or (algebraic) subgroup of…

统计理论 · 数学 2023-11-27 Nick W. Koning

Covariance alignment: from maximum likelihood estimation to Gromov-Wasserstein

Feature alignment methods are used in many scientific disciplines for data pooling, annotation, and comparison. As an instance of a permutation learning problem, feature alignment presents significant statistical and computational…

统计理论 · 数学 2023-11-23 Yanjun Han , Philippe Rigollet , George Stepaniants

Improving tensor regression by optimal model averaging

Tensors have broad applications in neuroimaging, data mining, digital marketing, etc. CANDECOMP/PARAFAC (CP) tensor decomposition can effectively reduce the number of parameters to gain dimensionality-reduction and thus plays a key role in…

统计理论 · 数学 2023-11-23 Qiushi Bu , Hua Liang , Xinyu Zhang , Jiahui Zou

Optimality in Mean Estimation: Beyond Worst-Case, Beyond Sub-Gaussian, and Beyond $1+\alpha$ Moments

There is growing interest in improving our algorithmic understanding of fundamental statistical problems such as mean estimation, driven by the goal of understanding the limits of what we can extract from valuable data. The state of the art…

统计理论 · 数学 2023-11-22 Trung Dang , Jasper C. H. Lee , Maoyuan Song , Paul Valiant

Complete Asymptotic Expansions and the High-Dimensional Bingham Distributions

For $d \ge 2$, let $X$ be a random vector having a Bingham distribution on $\mathcal{S}^{d-1}$, the unit sphere centered at the origin in $\R^d$, and let $\Sigma$ denote the symmetric matrix parameter of the distribution. Let $\Psi(\Sigma)$…

统计理论 · 数学 2023-11-22 Armine Bagyan , Donald Richards

ACF estimation via difference schemes for a semiparametric model with m-dependent errors

In this manuscript, we discuss a class of difference-based estimators of the autocovariance structure in a semiparametric regression model where the signal is discontinuous and the errors are serially correlated. The signal in this model…

统计理论 · 数学 2023-11-22 Michael Levine , Inder Tecuapetla-Gomez

Mixing properties for multivariate Hawkes processes

Properties of strong mixing have been established for the stationary linear Hawkes process in the univariate case, and can serve as a basis for statistical applications. In this paper, we provide the technical arguments needed to extend the…

统计理论 · 数学 2023-11-21 Ousmane Boly , Felix Cheysson , Thi Hien Nguyen

Functional relative error regression under left truncation and right censoring

The nonparametric estimators built by minimizing the mean squared relative error are gaining in popularity for their robustness in the presence of outliers in comparison to the Nadaraya Watson estimators. In this paper we build a relative…

统计理论 · 数学 2023-11-21 Adel Boucetta , Zohra Guessoum , Elias Ould-Said

Bell-INGARCH Model

Integer-valued time series exist widely in economics, finance, biology, computer science, medicine, insurance, and many other fields. In recent years, many types of models have been proposed to model integer-valued time series data, in…

统计理论 · 数学 2023-11-21 Ying Wang , Shuang Chen , Lianyong Qian

Asymptotic distributions of the average clustering coefficient and its variant

In network data analysis, summary statistics of a network can provide us with meaningful insight into the structure of the network. The average clustering coefficient is one of the most popular and widely used network statistics. In this…

统计理论 · 数学 2023-11-21 Mingao Yuan , Xiaofeng Zhao

New Asymptotic Limit Theory and Inference for Monotone Regression

Nonparametric regression problems with qualitative constraints such as monotonicity or convexity are ubiquitous in applications. For example, in predicting the yield of a factory in terms of the number of labor hours, the monotonicity of…

统计理论 · 数学 2023-11-21 Soham Mallick , Siddhaarth Sarkar , Arun Kumar Kuchibhotla