统计理论 — Scifaro

Bernstein's inequalities for general Markov chains

We establish Bernstein's inequalities for functions of general (general-state-space and possibly non-reversible) Markov chains. These inequalities achieve sharp variance proxies and encompass the classical Bernstein inequality for…

统计理论 · 数学 2025-04-18 Bai Jiang , Qiang Sun , Jianqing Fan

Shuffled Linear Regression via Spectral Matching

Shuffled linear regression (SLR) seeks to estimate latent features through a linear transformation, complicated by unknown permutations in the measurement dimensions. This problem extends traditional least-squares (LS) and Least Absolute…

统计理论 · 数学 2025-04-17 Hang Liu , Anna Scaglione

Sub-uniformity of harmonic mean p-values

We obtain several inequalities on the generalized means of dependent p-values. In particular, the weighted harmonic mean of p-values is strictly sub-uniform under several dependence assumptions of p-values, including independence, negative…

统计理论 · 数学 2025-04-17 Yuyu Chen , Ruodu Wang , Yuming Wang , Wenhao Zhu

Variance-Aware Estimation of Kernel Mean Embedding

An important feature of kernel mean embeddings (KME) is that the rate of convergence of the empirical KME to the true distribution KME can be bounded independently of the dimension of the space, properties of the distribution and smoothness…

统计理论 · 数学 2025-04-17 Geoffrey Wolfer , Pierre Alquier

Minimax asymptotics

In this paper, we consider asymptotics of the optimal value and the optimal solutions of parametric minimax estimation problems. Specifically, we consider estimators of the optimal value and the optimal solutions in a sample minimax problem…

统计理论 · 数学 2025-04-16 Mika Meitz , Alexander Shapiro

Is model selection possible for the $\ell_p$-loss? PCO estimation for regression models

This paper addresses the problem of model selection in the sequence model $Y=\theta+\varepsilon\xi$, when $\xi$ is sub-Gaussian, for non-euclidian loss-functions. In this model, the Penalized Comparison to Overfitting procedure is studied…

统计理论 · 数学 2025-04-16 Claire Lacour , Pascal Massart , Vincent Rivoirard

On relative universality, regression operator, and conditional independence

The notion of relative universality with respect to a {\sigma}-field was introduced to establish the unbiasedness and Fisher consistency of an estimator in nonlinear sufficient dimension reduction. However, there is a gap in the proof of…

统计理论 · 数学 2025-04-16 Bing Li , Ben Jones , Andreas Artemiou

Optimal inference for the mean of random functions

We study estimation and inference for the mean of real-valued random functions defined on a hypercube. The independent random functions are observed on a discrete, random subset of design points, possibly with heteroscedastic noise. We…

统计理论 · 数学 2025-04-16 Omar Kassi , Valentin Patilea

Bayesian analysis of regression discontinuity designs with heterogeneous treatment effects

Regression Discontinuity Design (RDD) is a popular framework for estimating a causal effect in settings where treatment is assigned if an observed covariate exceeds a fixed threshold. We consider estimation and inference in the common…

统计理论 · 数学 2025-04-16 Kevin Tao , Y. Samuel Wang , David Ruppert

Improved Convergence Rate of Nested Simulation with LSE on Sieve

Nested simulation encompasses the estimation of functionals linked to conditional expectations through simulation techniques. In this paper, we treat conditional expectation as a function of the multidimensional conditioning variable and…

统计理论 · 数学 2025-04-16 Ruoxue Liu , Liang Ding , Wenjia Wang , Lu Zou

On the Optimality of Functional Sliced Inverse Regression

In this paper, we prove that functional sliced inverse regression (FSIR) achieves the optimal (minimax) rate for estimating the central space in functional sufficient dimension reduction problems. First, we provide a concentration…

统计理论 · 数学 2025-04-16 Rui Chen , Songtao Tian , Dongming Huang , Qian Lin , Jun S. Liu

Spectral estimation for high-dimensional linear processes

We propose a novel estimation procedure for certain spectral distributions associated with a class of high dimensional linear time series. The processes under consideration are of the form $X_t = \sum_{\ell=0}^\infty \mathbf{A}_\ell…

统计理论 · 数学 2025-04-15 Jamshid Namdari , Alexander Aue , Debashis Paul

Kullback-Leibler excess risk bounds for exponential weighted aggregation in Generalized linear models

Aggregation methods have emerged as a powerful and flexible framework in statistical learning, providing unified solutions across diverse problems such as regression, classification, and density estimation. In the context of generalized…

统计理论 · 数学 2025-04-15 The Tien Mai

Parameters estimation of a Threshold Chan-Karolyi-Longstaff-Sanders process from continuous and discrete observations

We consider a continuous time process that is self-exciting and ergodic, called threshold Chan-Karolyi-Longstaff-Sanders (CKLS) process. This process is a generalization of various models in econometrics, such as Vasicek model,…

统计理论 · 数学 2025-04-15 Sara Mazzonetto , Benoît Nieto

Estimation for linear parabolic SPDEs in two space dimensions with unknown damping parameters

We study parametric estimation for second order linear parabolic stochastic partial differential equations (SPDEs) in two space dimensions driven by two types of $Q$-Wiener processes based on high frequency spatio-temporal data. First, we…

统计理论 · 数学 2025-04-15 Yozo Tonaki , Yusuke Kaino , Masayuki Uchida

Estimation of Change Points for Non-linear (auto-)regressive processes using Neural Network Functions

In this paper, we propose a new test for the detection of a change in a non-linear (auto-)regressive time series as well as a corresponding estimator for the unknown time point of the change. To this end, we consider an at-most-one-change…

统计理论 · 数学 2025-04-15 Claudia Kirch , Stefanie Schwaar

Strong Consistency of Sparse K-means Clustering

In this paper, we study the strong consistency of the sparse K-means clustering for high dimensional data. We prove the consistency in both risk and clustering for the Euclidean distance. We discuss the characterization of the limit of the…

统计理论 · 数学 2025-04-15 Jeungju Kim , Johan Lim

Functional worst risk minimization

The aim of this paper is to extend worst risk minimization, also called worst average loss minimization, to the functional realm. This means finding a functional regression representation that will be robust to future distribution shifts on…

统计理论 · 数学 2025-04-15 Philip Kennerberg , Ernst C. Wit

Understanding Best Subset Selection: A Tale of Two C(omplex)ities

We consider the problem of best subset selection (BSS) under high-dimensional sparse linear regression model. Recently, Guo et al. (2020) showed that the model selection performance of BSS depends on a certain identifiability margin, a…

统计理论 · 数学 2025-04-15 Saptarshi Roy , Ambuj Tewari , Ziwei Zhu

The Identification Problem for Linear Rational Expectations Models

This version corrects a number of mistakes that appeared in the previous draft. In particular, the (EU-LREM) condition is sufficient for existence and uniqueness but not necessary, as we had claimed. We are grateful to P. C. B. Phillips and…

统计理论 · 数学 2025-04-15 Majid M. Al-Sadoon , Piotr Zwiernik