统计理论 — Scifaro

Multivariate Quantiles: Geometric and Measure-Transportation-Based Contours

Quantiles are a fundamental concept in probability and theoretical statistics and a daily tool in their applications. While the univariate concept of quantiles is quite clear and well understood, its multivariate extension is more…

统计理论 · 数学 2024-01-08 Marc Hallin , Dimitri Konen

Outlier-robust additive matrix decomposition

We study least-squares trace regression when the parameter is the sum of a $r$-low-rank matrix and a $s$-sparse matrix and a fraction $\epsilon$ of the labels is corrupted. For subgaussian distributions and feature-dependent noise, we…

统计理论 · 数学 2024-01-08 Philip Thompson

Efficient nonparametric estimation of Toeplitz covariance matrices

A new nonparametric estimator for Toeplitz covariance matrices is proposed. This estimator is based on a data transformation that translates the problem of Toeplitz covariance matrix estimation to the problem of mean estimation in an…

统计理论 · 数学 2024-01-08 Karolina Klockmann , Tatyana Krivobokova

Singularity-agnostic incomplete U-statistics for testing polynomial constraints in Gaussian covariance matrices

Testing the goodness-of-fit of a model with its defining functional constraints in the parameters could date back to Spearman (1927), who analyzed the famous "tetrad" polynomial in the covariance matrix of the observed variables in a…

统计理论 · 数学 2024-01-05 Dennis Leung , Nils Sturma

Projection depth and $L^r$-type depths for fuzzy random variables

Statistical depth functions are a standard tool in nonparametric statistics to extend order-based univariate methods to the multivariate setting. Since there is no universally accepted total order for fuzzy data (even in the univariate…

统计理论 · 数学 2024-01-05 Luis González-De La Fuente , Alicia Nieto-Reyes , Pedro Terán

Better and Simpler Lower Bounds for Differentially Private Statistical Estimation

We provide optimal lower bounds for two well-known parameter estimation (also known as statistical estimation) tasks in high dimensions with approximate differential privacy. First, we prove that for any $\alpha \le O(1)$, estimating the…

统计理论 · 数学 2024-01-05 Shyam Narayanan

Robust density estimation with the $\mathbb{L}_{1}$-loss. Applications to the estimation of a density on the line satisfying a shape constraint

We solve the problem of estimating the distribution of presumed i.i.d. observations for the total variation loss. Our approach is based on density models and is versatile enough to cope with many different ones, including some density…

统计理论 · 数学 2024-01-05 Y. Baraud , H. Halconruy , G. Maillard

From robust tests to Bayes-like posterior distributions

In the Bayes paradigm and for a given loss function, we propose the construction of a new type of posterior distributions, that extends the classical Bayes one, for estimating the law of an $n$-sample. The loss functions we have in mind are…

统计理论 · 数学 2024-01-05 Yannick Baraud

Optimal transport map estimation in general function spaces

We study the problem of estimating a function $T$ given independent samples from a distribution $P$ and from the pushforward distribution $T_\sharp P$. This setting is motivated by applications in the sciences, where $T$ represents the…

统计理论 · 数学 2024-01-04 Vincent Divol , Jonathan Niles-Weed , Aram-Alexandre Pooladian

Joint mixability and notions of negative dependence

A joint mix is a random vector with a constant component-wise sum. The dependence structure of a joint mix minimizes some common objectives such as the variance of the component-wise sum, and it is regarded as a concept of extremal negative…

统计理论 · 数学 2024-01-04 Takaaki Koike , Liyuan Lin , Ruodu Wang

Liberating dimension and spectral norm: A universal approach to spectral properties of sample covariance matrices

In this paper, our objective is to present a constraining principle governing the spectral properties of the sample covariance matrix. This principle exhibits harmonious behavior across diverse limiting frameworks, eliminating the need for…

统计理论 · 数学 2024-01-03 Yanqing Yin

Improved estimators in Bell regression model with application

In this paper, we propose the application of shrinkage strategies to estimate coefficients in the Bell regression models when prior information about the coefficients is available. The Bell regression models are well-suited for modeling…

统计理论 · 数学 2024-01-03 Solmaz Seifollahi , Hossein Bevrani , Zakariya Yahya Algamal

A note on the prediction error of principal component regression in high dimensions

We analyze the prediction error of principal component regression (PCR) and prove high probability bounds for the corresponding squared risk conditional on the design. Our first main result shows that PCR performs comparably to the oracle…

统计理论 · 数学 2024-01-03 Laura Hucker , Martin Wahl

Universality in block dependent linear models with applications to nonparametric regression

Over the past decade, characterizing the exact asymptotic risk of regularized estimators in high-dimensional regression has emerged as a popular line of work. This literature considers the proportional asymptotics framework, where the…

统计理论 · 数学 2024-01-02 Samriddha Lahiry , Pragya Sur

Propagation of Input Tail Uncertainty in Rare-Event Estimation: A Light versus Heavy Tail Dichotomy

We consider the estimation of small probabilities or other risk quantities associated with rare but catastrophic events. In the model-based literature, much of the focus has been devoted to efficient Monte Carlo computation or analytical…

统计理论 · 数学 2024-01-02 Zhiyuan Huang , Henry Lam , Zhenyuan Liu

Estimation of the incubation time distribution in the singly and doubly interval censored model

We analyze nonparametric estimators for the distribution function of the incubation time in the singly and doubly interval censoring model. The classical approach is to use parametric families like Weibull, log-normal or gamma distributions…

统计理论 · 数学 2024-01-02 Piet Groeneboom

Arithmetic Average Density Fusion -- Part I: Some Statistic and Information-theoretic Results

Finite mixture such as the Gaussian mixture is a flexible and powerful probabilistic modeling tool for representing the multimodal distribution widely involved in many estimation and learning problems. The core of it is representing the…

统计理论 · 数学 2024-01-02 Tiancheng Li , Yan Song , Enbin Song , Hongqi Fan

Multivariate, Heteroscedastic Empirical Bayes via Nonparametric Maximum Likelihood

Multivariate, heteroscedastic errors complicate statistical inference in many large-scale denoising problems. Empirical Bayes is attractive in such settings, but standard parametric approaches rest on assumptions about the form of the prior…

统计理论 · 数学 2024-01-02 Jake A. Soloff , Adityanand Guntuboyina , Bodhisattva Sen

Signal-to-noise ratio aware minimaxity and higher-order asymptotics

Since its development, the minimax framework has been one of the corner stones of theoretical statistics, and has contributed to the popularity of many well-known estimators, such as the regularized M-estimators for high-dimensional…

统计理论 · 数学 2024-01-01 Yilin Guo , Haolei Weng , Arian Maleki

Local convergence rates of the nonparametric least squares estimator with applications to transfer learning

Convergence properties of empirical risk minimizers can be conveniently expressed in terms of the associated population risk. To derive bounds for the performance of the estimator under covariate shift, however, pointwise convergence rates…

统计理论 · 数学 2024-01-01 Johannes Schmidt-Hieber , Petr Zamolodtchikov