统计理论 — Scifaro

Aspects of a Generalized Theory of Sparsity based Inference in Linear Inverse Problems

Linear inverse problems are ubiquitous in various science and engineering disciplines. Of particular importance in the past few decades, is the incorporation of sparsity based priors, in particular $\ell_1$ priors, into linear inverse…

统计理论 · 数学 2025-03-04 Ryan O'Dowd , Raghu G. Raj , Hrushikesh N. Mhaskar

On a class of high dimensional linear regression methods with debiasing and thresholding

In this paper, we introduce a unified framework, inspired by classical regularization theory, for designing and analyzing a broad class of linear regression approaches. Our framework encompasses traditional methods like least squares…

统计理论 · 数学 2025-03-04 Ying-Ao Wang , Yunyi Zhang , Ye Zhang

Bayesian calculus and predictive characterizations of extended feature allocation models

We introduce and study a unified Bayesian framework for extended feature allocations which flexibly captures interactions -- such as repulsion or attraction -- among features and their associated weights. We provide a complete Bayesian…

统计理论 · 数学 2025-03-04 Mario Beraha , Federico Camerlenghi , Lorenzo Ghilotti

Linear cost and exponentially convergent approximation of Gaussian Mat\'ern processes on intervals

The computational cost for inference and prediction of statistical models based on Gaussian processes with Mat\'ern covariance functions scales cubicly with the number of observations, limiting their applicability to large data sets. The…

统计理论 · 数学 2025-03-04 David Bolin , Vaibhav Mehandiratta , Alexandre B. Simas

Testing Elliptical Models in High Dimensions

Due to the broad applications of elliptical models, there is a long line of research on goodness-of-fit tests for empirically validating them. However, the existing literature on this topic is generally confined to low-dimensional settings,…

统计理论 · 数学 2025-03-04 Siyao Wang , Miles E. Lopes

Asymptotic Behavior of Adversarial Training Estimator under $\ell_\infty$-Perturbation

Adversarial training has been proposed to protect machine learning models against adversarial attacks. This paper focuses on adversarial training under $\ell_\infty$-perturbation, which has recently attracted much research attention. The…

统计理论 · 数学 2025-03-04 Yiling Xie , Xiaoming Huo

Semi-parametric inference based on adaptively collected data

Many standard estimators, when applied to adaptively collected data, fail to be asymptotically normal, thereby complicating the construction of confidence intervals. We address this challenge in a semi-parametric context: estimating the…

统计理论 · 数学 2025-03-04 Licong Lin , Koulik Khamaru , Martin J. Wainwright

A direct extension of Azadkia & Chatterjee's rank correlation to multi-response vectors

Recently, Chatterjee (2023) recognized the lack of a direct generalization of his rank correlation $\xi$ in Azadkia and Chatterjee (2021) to a multi-dimensional response vector. As a natural solution to this problem, we here propose an…

统计理论 · 数学 2025-03-04 Jonathan Ansari , Sebastian Fuchs

Characterizing the Training-Conditional Coverage of Full Conformal Inference in High Dimensions

We study the coverage properties of full conformal regression in the proportional asymptotic regime where the ratio of the dimension and the sample size converges to a constant. In this setting, existing theory tells us only that full…

统计理论 · 数学 2025-03-03 Isaac Gibbs , Emmanuel J. Candès

Robust statistical inference for accelerated life-tests with one-shot devices under log-logistic distributions

A one-shot device is a unit that operates only once, after which it is either destroyed or needs to be rebuilt. For this type of device, the operational status can only be assessed at a specific inspection time, determining whether failure…

统计理论 · 数学 2025-03-03 María González-Calderón , María Jaenada , Leandro Pardo

Series ridge regression for spatial data on $\mathbb{R}^d$

This paper develops a general asymptotic theory of series estimators for spatial data collected at irregularly spaced locations within a sampling region $R_n \subset \mathbb{R}^d$. We employ a stochastic sampling design that can flexibly…

统计理论 · 数学 2025-03-03 Daisuke Kurisu , Yasumasa Matsuda

Statistical Inference for Random Unknowns via Modifications of Extended Likelihood

Fisher's likelihood is widely used for statistical inference for fixed unknowns. This paper aims to extend two important likelihood-based methods, namely the maximum likelihood procedure for point estimation and the confidence procedure for…

统计理论 · 数学 2025-03-03 Hangbin Lee , Youngjo Lee

Linear type conditional specifications for multivariate count variables

This paper investigates conditional specifications for multivariate count variables. Recently, the spatial count data literature has proposed several conditional models such that the conditional expectations are linear in the conditioning…

统计理论 · 数学 2025-02-28 Yang Lu , Wei Sun

Non-asymptotic Properties of Generalized Mondrian Forests in Statistical Learning

Random Forests have been extensively used in regression and classification, inspiring the development of various forest-based methods. Among these, Mondrian Forests, derived from the Mondrian process, mark a significant advancement.…

统计理论 · 数学 2025-02-28 Haoran Zhan , Jingli Wang , Yingcun Xia

A Full Adagrad algorithm with O(Nd) operations

A novel approach is given to overcome the computational challenges of the full-matrix Adaptive Gradient algorithm (Full AdaGrad) in stochastic optimization. By developing a recursive method that estimates the inverse of the square root of…

统计理论 · 数学 2025-02-28 Antoine Godichon-Baggioni , Wei Lu , Bruno Portier

Causal survival embeddings: non-parametric counterfactual inference under censoring

Model-free time-to-event regression under confounding presents challenges due to biases introduced by causal and censoring sampling mechanisms. This phenomenology poses problems for classical non-parametric estimators like Beran's or the…

统计理论 · 数学 2025-02-28 Carlos García-Meixide , Marcos Matabuena

Rates of convergence for nonparametric estimation of singular distributions using generative adversarial networks

It is common in nonparametric estimation problems to impose a certain low-dimensional structure on the unknown parameter to avoid the curse of dimensionality. This paper considers a nonparametric distribution estimation problem with a…

统计理论 · 数学 2025-02-28 Jeyong Lee , Hyeok Kyu Kwon , Minwoo Chae

Kernel Estimation for Nonlinear Dynamics

Many scientific problems involve data exhibiting both temporal and cross-sectional dependencies. While linear dependencies have been extensively studied, the theoretical analysis of regression estimators under nonlinear dependencies remains…

统计理论 · 数学 2025-02-27 Marie-Christine Düker , Adam Waterbury

Consistency of heritability estimation from summary statistics in high-dimensional linear models

In Genome-Wide Association Studies (GWAS), heritability is defined as the fraction of variance of an outcome explained by a large number of genetic predictors in a high-dimensional polygenic linear model. This work studies the asymptotic…

统计理论 · 数学 2025-02-27 David Azriel , Samuel Davenport , Armin Schwartzman

The No-Underrun Sampler: A Locally-Adaptive, Gradient-Free MCMC Method

In this work, we introduce the No-Underrun Sampler (NURS), a locally-adaptive, gradient-free Markov chain Monte Carlo method that blends ideas from Hit-and-Run and the No-U-Turn Sampler. NURS dynamically adapts to the local scale of the…

统计理论 · 数学 2025-02-27 Nawaf Bou-Rabee , Bob Carpenter , Sifan Liu , Stefan Oberdörster