统计理论 — Scifaro

Gaussian and Bootstrap Approximation for Matching-based Average Treatment Effect Estimators

We establish Gaussian approximation bounds for covariate and rank-matching-based Average Treatment Effect (ATE) estimators. By analyzing these estimators through the lens of stabilization theory, we employ the Malliavin-Stein method to…

统计理论 · 数学 2024-12-25 Zhaoyang Shi , Chinmoy Bhattacharjee , Krishnakumar Balasubramanian , Wolfgang Polonik

Statistical Learning Theory for Neural Operators

We present statistical convergence results for the learning of (possibly) non-linear mappings in infinite-dimensional spaces. Specifically, given a map $G_0:\mathcal X\to\mathcal Y$ between two separable Hilbert spaces, we analyze the…

统计理论 · 数学 2024-12-24 Niklas Reinhardt , Sven Wang , Jakob Zech

Adaptive Elastic-Net estimation for sparse diffusion processes

Penalized estimation methods for diffusion processes and dependent data have recently gained significant attention due to their effectiveness in handling high-dimensional stochastic systems. In this work, we introduce an adaptive…

统计理论 · 数学 2024-12-24 Alessandro De Gregorio , Dario Frisardi , Francesco Iafrate , Stefano Iacus

On the impossibility of detecting a late change-point in the preferential attachment random graph model

We consider the problem of late change-point detection under the preferential attachment random graph model with time dependent attachment function. This can be formulated as a hypothesis testing problem where the null hypothesis…

统计理论 · 数学 2024-12-24 Ibrahim Kaddouri , Zacharie Naulet , Élisabeth Gassiat

Increasing dimension asymptotics for two-way crossed mixed effect models

This paper presents asymptotic results for the maximum likelihood and restricted maximum likelihood (REML) estimators within a two-way crossed mixed effect model as the sizes of the rows, columns, and cells tend to infinity. Under very mild…

统计理论 · 数学 2024-12-24 Ziyang Lyu , S. A. Sisson , A. H. Welsh

Estimating a distribution function for discrete data subject to random truncation with an application to structured finance

Proper econometric analysis should be informed by data structure. Many forms of financial data are recorded in discrete-time and relate to products of a finite term. If the data comes from a financial trust, it will often be further subject…

统计理论 · 数学 2024-12-24 Jackson P. Lautier , Vladimir Pozdnyakov , Jun Yan

Motif Estimation via Subgraph Sampling: The Fourth Moment Phenomenon

Network sampling is an indispensable tool for understanding features of large complex networks where it is practically impossible to search over the entire graph. In this paper, we develop a framework for statistical inference for counting…

统计理论 · 数学 2024-12-24 Bhaswar B. Bhattacharya , Sayan Das , Sumit Mukherjee

On statistical model extensions based on randomly stopped extremes

The maxima and the minima of a randomly stopped sample of a random variable, $X$, together with two newly defined random variables that make $X$ into the maxima or minima of a randomly stopped sample of them, can be used to define…

统计理论 · 数学 2024-12-23 Jordi Valero , Josep Ginebra

Early stopping for conjugate gradients in statistical inverse problems

We consider estimators obtained by iterates of the conjugate gradient (CG) algorithm applied to the normal equation of prototypical statistical inverse problems. Stopping the CG algorithm early induces regularisation, and optimal…

统计理论 · 数学 2024-12-23 Laura Hucker , Markus Reiß

Asymptotic Equivalence for Nonparametric Generalized Linear Models

We establish that a non-Gaussian nonparametric regression model is asymptotically equivalent to a regression model with Gaussian noise. The approximation is in the sense of Le Cam's deficiency distance $\Delta $; the models are then…

统计理论 · 数学 2024-12-20 Ion Grama , Michael Nussbaum

Asymptotic Equivalence for Nonparametric Regression

We consider a nonparametric model $\mathcal{E}^{n},$ generated by independent observations $X_{i},$ $i=1,...,n,$ with densities $p(x,\theta_{i}),$ $i=1,...,n,$ the parameters of which $\theta _{i}=f(i/n)\in \Theta $ are driven by the values…

统计理论 · 数学 2024-12-20 Ion Grama , Michael Nussbaum

Strong Gaussian approximations with random multipliers

One reason why standard formulations of the central limit theorems are not applicable in high-dimensional and non-stationary regimes is the lack of a suitable limit object. Instead, suitable distributional approximations can be used, where…

统计理论 · 数学 2024-12-20 Fabian Mies

Log-concave Density Estimation with Independent Components

We propose a method for estimating a log-concave density on $\mathbb R^d$ from samples, under the assumption that there exists an orthogonal transformation that makes the components of the random vector independent. While log-concave…

统计理论 · 数学 2024-12-20 Sharvaj Kubal , Christian Campbell , Elina Robeva

Two-Step Mixed-Type Multivariate Bayesian Sparse Variable Selection with Shrinkage Priors

We introduce a Bayesian framework for mixed-type multivariate regression using continuous shrinkage priors. Our framework enables joint analysis of mixed continuous and discrete outcomes and facilitates variable selection from the $p$…

统计理论 · 数学 2024-12-20 Shao-Hsuan Wang , Ray Bai , Hsin-Hsiung Huang

Variable selection for partially linear single-index varying-coefficient model

This paper focuses on variable selection for a partially linear single-index varying-coefficient model. A regularized variable selection procedure by combining basis function approximations with SCAD penalty is proposed. It can…

统计理论 · 数学 2024-12-19 Lijuan Han , Liugen Xue , Junshan Xie

Asymptotic Normality of Log Likelihood Ratio and Fundamental Limit of the Weak Detection for Spiked Wigner Matrices

We consider the problem of detecting the presence of a signal in a rank-one spiked Wigner model. For general non-Gaussian noise, assuming that the signal is drawn from the Rademacher prior, we prove that the log likelihood ratio (LR) of the…

统计理论 · 数学 2024-12-19 Hye Won Chung , Jiho Lee , Ji Oon Lee

ROSE Random Forests for Robust Semiparametric Efficient Estimation

It is widely recognised that semiparametric efficient estimation can be hard to achieve in practice: estimators that are in theory efficient may require unattainable levels of accuracy for the estimation of complex nuisance functions. As a…

统计理论 · 数学 2024-12-18 Elliot H. Young , Rajen D. Shah

Even naive trees are consistent

The last decade has shed some light on theoretical properties such as their consistency for regression tasks. In the current paper, we propose a new class of very simple learners based on so-called naive trees. These naive trees partition…

统计理论 · 数学 2024-12-18 Nico Föge , Markus Pauly , Lena Schmid , Marc Ditzhaus

On the asymptotic properties of product-PCA under the high-dimensional setting

Principal component analysis (PCA) is a widely used dimension reduction method, but its performance is known to be non-robust to outliers. Recently, product-PCA (PPCA) has been shown to possess the efficiency-loss free ordering-robustness…

统计理论 · 数学 2024-12-17 Hung Hung , Chi-Chun Yeh , Su-Yun Huang

A clarification on the links between potential outcomes and do-interventions

Most of the scientific literature on causal modeling considers the structural framework of Pearl and the potential-outcome framework of Rubin to be formally equivalent, and therefore interchangeably uses do-interventions and the…

统计理论 · 数学 2024-12-17 Lucas de Lara