统计理论 — Scifaro

A theoretical framework for M-posteriors: frequentist guarantees and robustness properties

We provide a theoretical framework for a wide class of generalized posteriors that can be viewed as the natural Bayesian posterior counterpart of the class of M-estimators in the frequentist world. We call the members of this class…

统计理论 · 数学 2025-10-03 Juraj Marusic , Marco Avella Medina , Cynthia Rush

Quantifying and testing dependence to categorical variables

We suggest a dependence coefficient between a categorical variable and some general variable taking values in a metric space. We derive important theoretical properties and study the large sample behaviour of our suggested estimator.…

统计理论 · 数学 2025-10-03 Siegfried Hörmann , Daniel Strenger-Galvis

Optimal and Provable Calibration in High-Dimensional Binary Classification: Angular Calibration and Platt Scaling

We study the fundamental problem of calibrating a linear binary classifier of the form $\sigma(\hat{w}^\top x)$, where the feature vector $x$ is Gaussian, $\sigma$ is a link function, and $\hat{w}$ is an estimator of the true linear weight…

统计理论 · 数学 2025-10-03 Yufan Li , Pragya Sur

Mathematical Theory of Collinearity Effects on Machine Learning Variable Importance Measures

In many machine learning problems, understanding variable importance is a central concern. Two common approaches are Permute-and-Predict (PaP), which randomly permutes a feature in a validation set, and Leave-One-Covariate-Out (LOCO), which…

统计理论 · 数学 2025-10-02 Kelvyn K. Bladen , D. Richard Cutler , Alan Wisler

Stable Phase Retrieval: Optimal Rates in Poisson and Heavy-tailed Models

We investigate stable recovery guarantees for phase retrieval under two realistic and challenging noise models: the Poisson model and the heavy-tailed model. Our analysis covers both nonconvex least squares (NCVX-LS) and convex least…

统计理论 · 数学 2025-10-02 Gao Huang , Song Li , Deanna Needell

Zero variance self-normalized importance sampling via estimating equations

In ordinary importance sampling with a nonnegative integrand there exists an importance sampling strategy with zero variance. Practical sampling strategies are often based on approximating that optimal solution, potentially approaching zero…

统计理论 · 数学 2025-10-02 Art B. Owen

Performance of the empirical median for location estimation in heteroscedastic settings

We investigate the performance of the empirical median for location estimation in heteroscedastic settings. Specifically, we consider independent symmetric real-valued random variables that share a common but unknown location parameter…

统计理论 · 数学 2025-10-02 Sirine Louati

Extremal correlation coefficient for functional data

We propose a coefficient that measures dependence in paired samples of functions. It has properties similar to the Pearson correlation, but differs in significant ways: (i) it is designed to measure dependence between curves, (ii) it…

统计理论 · 数学 2025-10-02 Mihyun Kim , Piotr Kokoszka

Learning linear dynamical systems under convex constraints

We consider the problem of finite-time identification of linear dynamical systems from $T$ samples of a single trajectory. Recent results have predominantly focused on the setup where either no structural assumption is made on the system…

统计理论 · 数学 2025-10-02 Hemant Tyagi , Denis Efimov

A Tractable Family of Smooth Copulas with Rotational Dependence: Properties, Inference, and Application

We introduce a new family of copula densities constructed from univariate distributions on $[0,1]$. Although our construction is structurally simple, the resulting family is versatile: it includes both smooth and irregular examples, and…

统计理论 · 数学 2025-10-01 Michaël Lalancette , Robert Zimmerman

Nonparametric inference under shape constraints: past, present and future

We survey the field of nonparametric inference under shape constraints, providing a historical overview and a perspective on its current state. An outlook and some open problems offer thoughts on future directions.

统计理论 · 数学 2025-10-01 Richard J. Samworth

Monte Carlo on a single sample

In this paper, we consider a Monte Carlo simulation method (MinMC) that approximates prices and risk measures for a range $\Gamma$ of model parameters at once. The simulation method that we study has recently gained popularity [HS20, FPP22,…

统计理论 · 数学 2025-10-01 Nils Detering , Nicole Hufnagel , Paul Krühner

Nonparametric estimation of the stationary density for Hawkes-diffusion systems with known and unknown intensity

We investigate the nonparametric estimation problem of the density $\pi$, representing the stationary distribution of a two-dimensional system $\left(Z_t\right)_{t \in[0, T]}=\left(X_t, \lambda_t\right)_{t \in[0, T]}$. In this system, $X$…

统计理论 · 数学 2025-10-01 Chiara Amorino , Charlotte Dion-Blanc , Arnaud Gloter , Sarah Lemler

Consistency Theory of General Nonparametric Classification Methods in Cognitive Diagnosis

Cognitive diagnosis models have been popularly used in fields such as education, psychology, and social sciences. While parametric likelihood estimation is a prevailing method for fitting cognitive diagnosis models, nonparametric…

统计理论 · 数学 2025-10-01 Chengyu Cui , Yanlong Liu , Gongjun Xu

Learning single index model with gradient descent: spectral initialization and precise asymptotics

Non-convex optimization plays a central role in many statistics and machine learning problems. Despite the landscape irregularities for general non-convex functions, some recent work showed that for many learning problems with random data…

统计理论 · 数学 2025-09-30 Yuchen Chen , Yandi Shen

Misspecified Maximum Likelihood Estimation for Non-Uniform Group Orbit Recovery

We study maximum likelihood estimation (MLE) in the generalized group orbit recovery model, where each observation is generated by applying a random group action and a known, fixed linear operator to an unknown signal, followed by additive…

统计理论 · 数学 2025-09-30 Sheng Xu , Anderson Ye Zhang , Amit Singer

Generalization Analysis for Classification on Korobov Space

In this paper, the classification algorithm arising from Tikhonov regularization is discussed. The main intention is to derive learning rates for the excess misclassification error according to the convex $\eta$-norm loss function…

统计理论 · 数学 2025-09-30 Yuqing Liu

A note on the relation between one-step, outcome regression and IPW-type estimators of parameters with the mixed bias property

Bruns-Smith et al. (2025) established an algebraic identity between the one-step estimator and a specific outcome regression-type estimator for a class of parameters that forms a strict subset of the class introduced in Chernozhukov et al.…

统计理论 · 数学 2025-09-30 Andrea Rotnitzky , Ezequiel Smucler , James M. Robins

Distinguishability of causal structures under latent confounding and selection

Statistical relationships in observed data can arise for several different reasons: the observed variables may be causally related, they may share a latent common cause, or there may be selection bias. Each of these scenarios can be…

统计理论 · 数学 2025-09-30 Ryan Carey , Marina Maciel Ansanelli , Elie Wolfe , Robin J. Evans

Bayesian Predictive Inference Beyond Martingales

There is a growing interest in the so-called Bayesian Predictive Inference approach, which allows to perform Bayesian inference without specifying the likelihood and prior of the model, or the need of any MCMC. Instead, only a sequence of…

统计理论 · 数学 2025-09-30 Marco Battiston , Lorenzo Cappello