统计理论 — Scifaro

Regularized R\'enyi divergence minimization through Bregman proximal gradient algorithms

We study the variational inference problem of minimizing a regularized R\'enyi divergence over an exponential family. We propose to solve this problem with a Bregman proximal gradient algorithm. We propose a sampling-based algorithm to…

统计理论 · 数学 2024-10-17 Thomas Guilmeau , Emilie Chouzenoux , Víctor Elvira

On the lack of weak continuity of Chatterjee's correlation coefficient

Chatterjee's correlation coefficient has recently been proposed as a new association measure for bivariate random vectors that satisfies a number of desirable properties. Among these properties is the feature that the coefficient equals one…

统计理论 · 数学 2024-10-16 Axel Bücher , Holger Dette

Asymptotic Theory for Estimation of the Husler-Reiss Distribution via Block Maxima Method

The H\"usler-Reiss distribution describes the limit of the pointwise maxima of a bivariate normal distribution. This distribution is defined by a single parameter, $\lambda$. We provide asymptotic theory for maximum likelihood estimation of…

统计理论 · 数学 2024-10-16 Hank Flury , Jan Hannig , Richard Smith

Statistical inference of partially linear time-varying coefficients spatial autoregressive panel data model

This paper investigates a partially linear spatial autoregressive panel data model that incorporates fixed effects, constant and time-varying regression coefficients, and a time-varying spatial lag coefficient. A two-stage least squares…

统计理论 · 数学 2024-10-15 Lingling Tian , Chuanhua Wei , Mixia Wu

A non-asymptotic upper bound in prediction for the PLS estimator

We investigate the theoretical performances of the Partial Least Square (PLS) algorithm in a high dimensional context. We provide upper bounds on the risk in prediction for the statistical linear model when considering the PLS estimator.…

统计理论 · 数学 2024-10-15 Luca Castelli , Irène Gannaz , Clément Marteau

Testing for unspecified periodicities in binary time series

Given independent random variables $Y_1, \ldots, Y_n$ with $Y_i \in \{0,1\}$ we test the hypothesis whether the underlying success probabilities $p_i$ are constant or whether they are periodic with an unspecified period length of $r \ge 2$.…

统计理论 · 数学 2024-10-15 Finn Schmidtke , Mathias Vetter

Knockoffs for exchangeable categorical covariates

Let $X=(X_1,\ldots,X_p)$ be a $p$-variate random vector and $F$ a fixed finite set. In a number of applications, mainly in genetics, it turns out that $X_i\in F$ for each $i=1,\ldots,p$. Despite the latter fact, to obtain a knockoff…

统计理论 · 数学 2024-10-15 Emanuela Dreassi , Luca Pratelli , Pietro Rigo

Dimension-free uniform concentration bound for logistic regression

We provide a novel dimension-free uniform concentration bound for the empirical risk function of constrained logistic regression. Our bound yields a milder sufficient condition for a uniform law of large numbers than conditions derived by…

统计理论 · 数学 2024-10-15 Shogo Nakakita

The Minimax Rate of HSIC Estimation for Translation-Invariant Kernels

Kernel techniques are among the most influential approaches in data science and statistics. Under mild conditions, the reproducing kernel Hilbert space associated to a kernel is capable of encoding the independence of $M\ge 2$ random…

统计理论 · 数学 2024-10-15 Florian Kalinke , Zoltan Szabo

Existence of solutions to the nonlinear equations characterizing the precise error of M-estimators

Major progress has been made in the previous decade to characterize the asymptotic behavior of regularized M-estimators in high-dimensional regression problems in the proportional asymptotic regime where the sample size $n$ and the number…

统计理论 · 数学 2024-10-15 Pierre C. Bellec , Takuya Koriyama

A Dimension-Independent Bound on the Wasserstein Contraction Rate of a Geodesic Random Walk on the Sphere

We theoretically analyze the properties of a geodesic random walk on the Euclidean $d$-sphere. Specifically, we prove that the random walk's transition kernel is Wasserstein contractive with a contraction rate which can be bounded from…

统计理论 · 数学 2024-10-15 Philip Schär , Thilo D. Stier

Testing Independence of Infinite Dimensional Random Elements: A Sup-norm Approach

In this article, we study the test for independence of two random elements $X$ and $Y$ lying in an infinite dimensional space ${\cal{H}}$ (specifically, a real separable Hilbert space equipped with the inner product $\langle .,…

统计理论 · 数学 2024-10-15 Suprio Bhar , Subhra Sankar Dhar

MARS via LASSO

Multivariate adaptive regression splines (MARS) is a popular method for nonparametric regression introduced by Friedman in 1991. MARS fits simple nonlinear and non-additive functions to regression data. We propose and study a natural lasso…

统计理论 · 数学 2024-10-15 Dohyeong Ki , Billy Fang , Adityanand Guntuboyina

A primer on linear classification with missing data

Supervised learning with missing data aims at building the best prediction of a target output based on partially-observed inputs. Major approaches to address this problem can be decomposed into $(i)$ impute-then-predict strategies, which…

统计理论 · 数学 2024-10-14 Angel D Reyero Lobo , Alexis Ayme , Claire Boyer , Erwan Scornet

Consistency for constrained maximum likelihood estimation and clustering based on mixtures of elliptically-symmetric distributions under general data generating processes

The consistency of the maximum likelihood estimator for mixtures of elliptically-symmetric distributions for estimating its population version is shown, where the underlying distribution $P$ is nonparametric and does not necessarily belong…

统计理论 · 数学 2024-10-14 Pietro Coretto , Christian Hennig

Generalized Median of Means Principle for Bayesian Inference

The topic of robustness is experiencing a resurgence of interest in the statistical and machine learning communities. In particular, robust algorithms making use of the so-called median of means estimator were shown to satisfy strong…

统计理论 · 数学 2024-10-14 Stanislav Minsker , Shunan Yao

Multiple conditional randomization tests for lagged and spillover treatment effects

We consider the problem of constructing multiple independent conditional randomization tests using a single dataset. Because the tests are independent, the randomization p-values can be interpreted individually and combined using standard…

统计理论 · 数学 2024-10-14 Yao Zhang , Qingyuan Zhao

With random regressors, least squares inference is robust to correlated errors with unknown correlation structure

Linear regression is arguably the most widely used statistical method. With fixed regressors and correlated errors, the conventional wisdom is to modify the variance-covariance estimator to accommodate the known correlation structure of the…

统计理论 · 数学 2024-10-11 Zifeng Zhang , Peng Ding , Wen Zhou , Haonan Wang

Equivalence of Approximate Message Passing and Low-Degree Polynomials in Rank-One Matrix Estimation

We consider the problem of estimating an unknown parameter vector ${\boldsymbol \theta}\in{\mathbb R}^n$, given noisy observations ${\boldsymbol Y} = {\boldsymbol \theta}{\boldsymbol \theta}^{\top}/\sqrt{n}+{\boldsymbol Z}$ of the rank-one…

统计理论 · 数学 2024-10-11 Andrea Montanari , Alexander S. Wein

On the maximum likelihood degree for Gaussian graphical models

In this paper we revisit the likelihood geometry of Gaussian graphical models. We give a detailed proof that the ML-degree behaves monotonically on induced subgraphs. Furthermore, we complete a missing argument that the ML-degree of the…

统计理论 · 数学 2024-10-10 Carlos Améndola , Rodica Andreea Dinu , Mateusz Michałek , Martin Vodička