统计理论 — Scifaro

Statistically guided deep learning

We present a theoretically well-founded deep learning algorithm for nonparametric regression. It uses over-parametrized deep neural networks with logistic activation function, which are fitted to the given data via gradient descent. We…

统计理论 · 数学 2025-04-14 Michael Kohler , Adam Krzyzak

Minimax-optimal and Locally-adaptive Online Nonparametric Regression

We study adversarial online nonparametric regression with general convex losses and propose a parameter-free learning algorithm that achieves minimax optimal rates. Our approach leverages chaining trees to compete against H{\"o}lder…

统计理论 · 数学 2025-04-14 Paul Liautaud , Pierre Gaillard , Olivier Wintenberger

Note on the identification of total effect in Cluster-DAGs with cycles

In this note, we discuss the identifiability of a total effect in cluster-DAGs, allowing for cycles within the cluster-DAG (while still assuming the associated underlying DAG to be acyclic). This is presented into two key results: first,…

统计理论 · 数学 2025-04-11 Clément Yvernes

A GARMA Framework for Unit-Bounded Time Series Based on the Unit-Lindley Distribution with Application to Renewable Energy Data

The Unit-Lindley is a one-parameter family of distributions in $(0,1)$ obtained from an appropriate transformation of the Lindley distribution. In this work, we introduce a class of dynamical time series models for continuous random…

统计理论 · 数学 2025-04-11 Guilherme Pumi , Danilo Hiroshi Matsuoka , Taiane Schaedler Prass

Advances in Bayesian model selection consistency for high-dimensional generalized linear models

Uncovering genuine relationships between a response variable of interest and a large collection of covariates is a fundamental and practically important problem. In the context of Gaussian linear models, both the Bayesian and non-Bayesian…

统计理论 · 数学 2025-04-11 Jeyong Lee , Minwoo Chae , Ryan Martin

Assessment of the quality of a prediction

Shannon defined the mutual information between two variables. We illustrate why the true mutual information between a variable and the predictions made by a prediction algorithm is not a suitable measure of prediction quality, but the…

统计理论 · 数学 2025-04-11 Roger Sewell

An Unbiased Variance Estimator with Denominator $N$

Standard practice obtains an unbiased variance estimator by dividing by $N-1$ rather than $N$. Yet if only half the data are used to compute the mean, dividing by $N$ can still yield an unbiased estimator. We show that an alternative mean…

统计理论 · 数学 2025-04-10 Dai Akita

Bounds in Wasserstein Distance for Locally Stationary Functional Time Series

Functional time series (FTS) extend traditional methodologies to accommodate data observed as functions/curves. A significant challenge in FTS consists of accurately capturing the time-dependence structure, especially with the presence of…

统计理论 · 数学 2025-04-10 Jan Nino G. Tinio , Mokhtar Z. Alaya , Salim Bouzebda

Differentially Private Joint Independence Test

Identification of joint dependence among more than two random vectors plays an important role in many statistical applications, where the data may contain sensitive or confidential information. In this paper, we consider the the…

统计理论 · 数学 2025-04-10 Xingwei Liu , Yuexin Chen , Wangli Xu

Zero patterns in multi-way binary contingency tables with uniform margins

We study the problem of transforming a multi-way contingency table into an equivalent table with uniform margins and same dependence structure. This is an old question which relates to recent advances in copula modeling for discrete random…

统计理论 · 数学 2025-04-10 Roberto Fontana , Elisa Perrone , Fabio Rapallo

Sparse PCA: Phase Transitions in the Critical Regime

This work studies estimation of sparse principal components in high dimensions. Specifically, we consider a class of estimators based on kernel PCA, generalizing the covariance thresholding algorithm proposed by Krauthgamer et al. (2015).…

统计理论 · 数学 2025-04-10 Michael J. Feldman , Theodor Misiakiewicz , Elad Romanov

To ignore dependencies is perhaps not a sin

We present a result according to which certain functions of covariance matrices are maximized at scalar multiples of the identity matrix. In a statistical context in which such functions measure loss, this says that the least favourable…

统计理论 · 数学 2025-04-10 Douglas P. Wiens

Semi-parametric Bernstein-von Mises in Linear Inverse Problems

We consider a Bayesian approach for the recovery of scalar parameters arising in inverse problems. We consider a general signal-in white noise model where we have access to two independent noisy observations of a function, and of a linear…

统计理论 · 数学 2025-04-10 Adel Magra , Aad van der Vaart , Harry van Zanten

Nonparametric local polynomial regression for functional covariates

We consider nonparametric regression with functional covariates, that is, they are elements of an infinite-dimensional Hilbert space. A locally polynomial estimator is constructed, where an orthonormal basis and various tuning parameters…

统计理论 · 数学 2025-04-09 Moritz Jirak , Alois Kneip , Alexander Meister , Mario Pahl

Revisiting poverty measures using quantile functions

In this article we redefine various poverty measures in literature in terms of quantile functions instead of distribution functions in the prevailing approach. This enables provision for alternative methodology for poverty measurement and…

统计理论 · 数学 2025-04-09 N. Unnikrishnan Nair , S. M. Sunoj

Truncated sequential guaranteed estimation for the Cox-Ingersoll-Ross models

The drift sequential parameter estimation problems for the Cox-Ingersoll-Ross (CIR) processes under the limited duration of observation are studied. Truncated sequential estimation methods for both scalar and {two}-dimensional parameter…

统计理论 · 数学 2025-04-08 Mohamed Ben Alaya , Thi-Bao Trâm Ngô , Serguei Pergamenchtchikov

Extension of Yager's negation of probability distribution based on uncertainty measures

Existing research on negations primarily focuses on entropy and extropy. Recently, new functions such as varentropy and varextropy have been developed, which can be considered as extensions of entropy and extropy. However, the impact of…

统计理论 · 数学 2025-04-08 Santosh Kumar Chaudhary , Pradeep Kumar Sahu , Nitin Gupta

Gaussian Mean Testing under Truncation

We consider the task of Gaussian mean testing, that is, of testing whether a high-dimensional vector perturbed by white noise has large magnitude, or is the zero vector. This question, originating from the signal processing community, has…

统计理论 · 数学 2025-04-08 Clément L. Canonne , Themis Gouleakis , Yuhao Wang , Joy Qiping Yang

Common Drivers in Sparsely Interacting Hawkes Processes

We study a multivariate Hawkes process as a model for time-continuous relational event networks. The model does not assume the network to be known, it includes covariates, and it allows for both common drivers, parameters common to all the…

统计理论 · 数学 2025-04-08 Alexander Kreiss , Enno Mammen , Wolfgang Polonik

Asymptotics for estimating a diverging number of parameters -- with and without sparsity

We consider high-dimensional estimation problems where the number of parameters diverges with the sample size. General conditions are established for consistency, uniqueness, and asymptotic normality in both unpenalized and penalized…

统计理论 · 数学 2025-04-08 Jana Gauss , Thomas Nagler