统计理论 — Scifaro

SCMD: A Kernel-Based Distance for Structural Causal Models to Quantify Transferability Across Environments

Out-of-distribution generalization is key to building models that remain reliable across diverse environments. Recent causality-based methods address this challenge by learning invariant causal relationships in the underlying…

统计理论 · 数学 2025-10-24 Théotime Le Goff , Émilie Devijver

Eigenstructure inference for high-dimensional covariance with generalized shrinkage inverse-Wishart prior

In multivariate statistics, estimating the covariance matrix is essential for understanding the interdependence among variables. In high-dimensional settings, where the number of covariates increases with the sample size, it is well known…

统计理论 · 数学 2025-10-24 Seongmin Kim , Kwangmin Lee , Sewon Park , Jaeyong Lee

Transition of $\alpha$-mixing in Random Iterations with Applications in Queuing Theory

Nonlinear time series models with exogenous regressors are essential in econometrics, queuing theory, and machine learning, though their statistical analysis remains incomplete. Key results, such as the law of large numbers and the…

统计理论 · 数学 2025-10-24 Attila Lovas

ROTI-GCV: Generalized Cross-Validation for right-ROTationally Invariant Data

Two key tasks in high-dimensional regularized regression are tuning the regularization strength for accurate predictions and estimating the out-of-sample risk. It is known that the standard approach -- $k$-fold cross-validation -- is…

统计理论 · 数学 2025-10-24 Kevin Luo , Yufan Li , Pragya Sur

Statistical Inference for Linear Functionals of Online SGD in High-dimensional Linear Regression

Stochastic gradient descent (SGD) has emerged as the quintessential method in a data scientist's toolbox. Using SGD for high-stakes applications requires, however, careful quantification of the associated uncertainty. Towards that end, in…

统计理论 · 数学 2025-10-24 Bhavya Agrawalla , Krishnakumar Balasubramanian , Promit Ghosal

Probabilistic PCA on tensors

In probabilistic principal component analysis (PPCA), an observed vector is modeled as a linear transformation of a low-dimensional Gaussian factor plus isotropic noise. We generalize PPCA to tensors by constraining the loading operator to…

统计理论 · 数学 2025-10-23 Yaoming Zhen , Piotr Zwiernik

Error Analysis of Triangular Optimal Transport Maps for Filtering

We present a systematic analysis of estimation errors for a class of optimal transport based algorithms for filtering and data assimilation. Along the way, we extend previous error analyses of Brenier maps to the case of conditional Brenier…

统计理论 · 数学 2025-10-23 Mohammad Al-Jarrah , Bamdad Hosseini , Niyizhen Jin , Michele Martino , Amirhossein Taghvaei

Generalizing while preserving monotonicity in comparison-based preference learning models

If you tell a learning model that you prefer an alternative $a$ over another alternative $b$, then you probably expect the model to be monotone, that is, the valuation of $a$ increases, and that of $b$ decreases. Yet, perhaps surprisingly,…

统计理论 · 数学 2025-10-23 Julien Fageot , Peva Blanchard , Gilles Bareilles , Lê-Nguyên Hoang

Hyperparameter Selection via Early Stopping for Bayesian Semilinear PDEs

We study non-linear Bayesian inverse problems arising from semilinear partial differential equations (PDEs) that can be transformed into linear Bayesian inverse problems. We are then able to extend the early stopping for Ensemble…

统计理论 · 数学 2025-10-22 Maia Tienstra , Gottfried Hastermann

New closed-form estimators for discrete distributions

We revisit the problem of parameter estimation for discrete probability distributions with values in $\mathbb{Z}^d$. To this end, we adapt a technique called Stein's Method of Moments to discrete distributions which often gives closed-form…

统计理论 · 数学 2025-10-22 Adrian Fischer

Wasserstein projection estimators for circular distributions

For statistical models on circles, we investigate performance of estimators defined as the projections of the empirical distribution with respect to the Wasserstein distance. We develop algorithms for computing the Wasserstein projection…

统计理论 · 数学 2025-10-22 Naoki Otani , Takeru Matsuda

Consistency of Nonparametric Density Estimators in CAT(0) Orthant Space

The inference of evolutionary histories is a central problem in evolutionary biology. The analysis of a sample of phylogenetic trees can be conducted in Billera-Holmes-Vogtmann tree space, which is a CAT(0) metric space of phylogenetic…

统计理论 · 数学 2025-10-22 Yuki Takazawa , Tomonari Sei

On A Necessary Condition For Posterior Inconsistency: New Insights From A Classic Counterexample

The consistency of posterior distributions in density estimation is at the core of Bayesian statistical theory. Classical work established sufficient conditions, typically combining KL support with complexity bounds on sieves of high prior…

统计理论 · 数学 2025-10-22 Nicola Bariletto , Stephen G. Walker

Early Stopping for Ensemble Kalman-Bucy Inversion

Bayesian linear inverse problems aim to recover an unknown signal from noisy observations, incorporating prior knowledge. This paper analyses a data-dependent method to choose the scale parameter of a Gaussian prior. The method we study…

统计理论 · 数学 2025-10-22 Maia Tienstra , Sebastian Reich

On Learning the Optimal Regularization Parameter in Inverse Problems

Selecting the best regularization parameter in inverse problems is a classical and yet challenging problem. Recently, data-driven approaches have become popular to tackle this challenge. These approaches are appealing since they do require…

统计理论 · 数学 2025-10-22 Jonathan Chirinos Rodriguez , Ernesto De Vito , Cesare Molinari , Lorenzo Rosasco , Silvia Villa

Wild regenerative block bootstrap for Harris recurrent Markov chains

We consider Gaussian and bootstrap approximations for the supremum of additive functionals of aperiodic Harris recurrent Markov chains. The supremum is taken over a function class that may depend on the sample size, which allows for…

统计理论 · 数学 2025-10-21 Kyuseong Choi , Gabriella Ciolek

A Unified Approach to Statistical Estimation Under Nonlinear Observations: Tensor Estimation and Matrix Factorization

We consider the estimation of some parameter $\mathbf{x}$ living in a cone from the nonlinear observations of the form $\{y_i=f_i(\langle\mathbf{a}_i,\mathbf{x}\rangle)\}_{i=1}^m$. We develop a unified approach that first constructs a…

统计理论 · 数学 2025-10-21 Junren Chen , Lijun Ding , Dong Xia , Ming Yuan

Filtering Problem for Functionals of Stationary Processes with Missing Observations

The problem of the mean-square optimal linear estimation of the functional $A\xi=\ \int\limits_{R^s}a(t)\xi(-t)dt,$ which depends on the unknown values of stochastic stationary process $\xi(t)$ from observations of the process…

统计理论 · 数学 2025-10-21 Mykhailo Moklyachuk , Maria Sidei

Robust extrapolation problem for stochastic sequences with stationary increments

The problem of optimal estimation of functionals $A\xi =\sum\nolimits_{k=0}^{\infty }{}a(k)\xi (k)$ and ${{A}_{N}}\xi =\sum\nolimits_{k=0}^{N}{}a(k)\xi (k)$ which depend on the unknown values of stochastic sequence $\xi (k)$ with stationary…

统计理论 · 数学 2025-10-21 Maksym Luz , Mykhailo Moklyachuk

Estimating location parameters of several exponential distributions with ordered restriction under Linex loss function

Some improved estimators of the location parameters of several exponential distributions with ordered restriction are derived and compared numerically using Monte Carlo simulations. Note that the two-parameter exponential distribution is…

统计理论 · 数学 2025-10-21 Shrajal Bajpai , Lakshmi Kanta Patra , Suchandan Kayal