统计理论 — Scifaro

Minimax Signal Detection in Sparse Additive Models

Sparse additive models are an attractive choice in circumstances calling for modelling flexibility in the face of high dimensionality. We study the signal detection problem and establish the minimax separation rate for the detection of a…

统计理论 · 数学 2024-10-03 Subhodh Kotekal , Chao Gao

Some notes on the $k$-means clustering for missing data

The classical $k$-means clustering requires a complete data matrix without missing entries. As a natural extension of the $k$-means clustering for missing data, the $k$-POD clustering has been proposed, which ignores the missing entries in…

统计理论 · 数学 2024-10-02 Yoshikazu Terada , Xin Guan

Optimal Designs for Regression on Lie Groups

We consider a linear regression model with complex-valued response and predictors from a compact and connected Lie group. The regression model is formulated in terms of eigenfunctions of the Laplace-Beltrami operator on the Lie group. We…

统计理论 · 数学 2024-10-02 Somnath Chakraborty , Holger Dette , Martin Kroll

Matching prior pairs connecting Maximum A Posteriori estimation and posterior expectation

Bayesian statistics has two common measures of central tendency of a posterior distribution: posterior means and Maximum A Posteriori (MAP) estimates. In this paper, we discuss a connection between MAP estimates and posterior means. We…

统计理论 · 数学 2024-10-02 Michiko Okudo , Keisuke Yano

An instrumental variable approach under dependent censoring

This paper considers the problem of inferring the causal effect of a variable $Z$ on a dependently censored survival time $T$. We allow for unobserved confounding variables, such that the error term of the regression model for $T$ is…

统计理论 · 数学 2024-10-02 Gilles Crommen , Jad Beyhum , Ingrid Van Keilegom

Conditional Diffusion Models are Minimax-Optimal and Manifold-Adaptive for Conditional Distribution Estimation

We consider a class of conditional forward-backward diffusion models for conditional generative modeling, that is, generating new data given a covariate (or control variable). To formally study the theoretical properties of these…

统计理论 · 数学 2024-10-01 Rong Tang , Lizhen Lin , Yun Yang

Diagnostic checking of periodic vector autoregressive time series models with dependent errors

In this article, we study the asymptotic behaviour of the residual autocorrelations for periodic vector autoregressive time series models (PVAR henceforth) with uncorrelated but dependent innovations (i.e., weak PVAR). We then deduce the…

统计理论 · 数学 2024-10-01 Yacouba Boubacar Mainassara , Eugen Ursu

Detecting Change-points in Mean of Multivariate Time Series

This work delves into presenting a probabilistic method for analyzing linear process data with weakly dependent innovations, focusing on detecting change-points in the mean and estimating its spectral density. We develop a test for…

统计理论 · 数学 2024-10-01 Ramkrishna Jyoti Samanta

A New Perspective On Denoising Based On Optimal Transport

In the standard formulation of the denoising problem, one is given a probabilistic model relating a latent variable $\Theta \in \Omega \subset \mathbb{R}^m \; (m\ge 1)$ and an observation $Z \in \mathbb{R}^d$ according to: $Z \mid \Theta…

统计理论 · 数学 2024-10-01 Nicolas Garcia Trillos , Bodhisattva Sen

Root-n consistent semiparametric learning with high-dimensional nuisance functions under minimal sparsity

Treatment effect estimation under unconfoundedness is a fundamental task in causal inference. In response to the challenge of analyzing high-dimensional datasets collected in substantive fields such as epidemiology, genetics, economics, and…

统计理论 · 数学 2024-10-01 Lin Liu , Xinbo Wang , Yuhao Wang

Empirical partially Bayes multiple testing and compound $\chi^2$ decisions

A common task in high-throughput biology is to screen for associations across thousands of units of interest, e.g., genes or proteins. Often, the data for each unit are modeled as Gaussian measurements with unknown mean and variance and are…

统计理论 · 数学 2024-10-01 Nikolaos Ignatiadis , Bodhisattva Sen

A Sparse Beta Regression Model for Network Analysis

For statistical analysis of network data, the $\beta$-model has emerged as a useful tool, thanks to its flexibility in incorporating nodewise heterogeneity and theoretical tractability. To generalize the $\beta$-model, this paper proposes…

统计理论 · 数学 2024-10-01 Stefan Stein , Rui Feng , Chenlei Leng

Optimal Differentially Private PCA and Estimation for Spiked Covariance Matrices

Estimating a covariance matrix and its associated principal components is a fundamental problem in contemporary statistics. While optimal estimation procedures have been developed with well-understood properties, the increasing demand for…

统计理论 · 数学 2024-09-30 T. Tony Cai , Dong Xia , Mengyue Zha

One and two sample Dvoretzky-Kiefer-Wolfowitz-Massart type inequalities for differing underlying distributions

Kolmogorov-Smirnov (KS) tests rely on the convergence to zero of the KS-distance $d(F_n,G)$ in the one sample case, and of $d(F_n,G_m)$ in the two sample case. In each case the assumption (the null hypothesis) is that $F=G$, and so…

统计理论 · 数学 2024-09-27 Nicolas G. Underwood , Fabien Paillusson

Matrix variate p-value in MANOVA

The distribution functions of the matricvariate beta type I and II distributions are studied under real normed division algebras. The unified approach for real, complex, quaternions and octonions, also considers general properties and…

统计理论 · 数学 2024-09-27 José A. Díaz-García , Francisco J. Caro-Lopera

The Gaussian entropy map in valued fields

We exhibit the analog of the entropy map for multivariate Gaussian distributions on local fields. As in the real case, the image of this map lies in the supermodular cone and it determines the distribution of the valuation vector. In…

统计理论 · 数学 2024-09-27 Yassine El Maazouz

Gaussian Processes for Observational Dose-Response Inference

We adapt Gaussian processes for estimating the average dose-response function in observational settings, introducing a powerful complement to treatment effect estimation for understanding heterogeneous effects. We incorporate samples from a…

统计理论 · 数学 2024-09-26 Jake R. Dailey

Asymptotically efficient estimators for tail probabilities of extremals of $\beta$-Jacobi ensembles

In this paper, we consider the tail probabilities of extremals of $\beta$-Jacobi ensemble which plays an important role in multivariate analysis. The key steps in constructing estimators rely on the rate functions of large deviations.…

统计理论 · 数学 2024-09-26 Yutao Ma , Siyu Wang

The loss landscape of deep linear neural networks: a second-order analysis

We study the optimization landscape of deep linear neural networks with the square loss. It is known that, under weak assumptions, there are no spurious local minima and no local maxima. However, the existence and diversity of non-strict…

统计理论 · 数学 2024-09-26 El Mehdi Achour , François Malgouyres , Sébastien Gerchinovitz

Quasi Maximum Likelihood Estimation and Inference of Large Approximate Dynamic Factor Models via the EM algorithm

We study estimation of large Dynamic Factor models implemented through the Expectation Maximization (EM) algorithm, jointly with the Kalman smoother. We prove that as both the cross-sectional dimension, $n$, and the sample size, $T$,…

统计理论 · 数学 2024-09-26 Matteo Barigozzi , Matteo Luciani