统计理论
This book develops the theory of quasi-Banach $K_\sigma$-spaces $\mathbf{F}_\psi(\Omega)$, $\mathbf{F}_\psi^*(\Omega)$, and $D_{V,W}(\Omega)$ of random variables and stochastic processes, extending the classical framework of Orlicz spaces,…
In this work, we investigate how to develop sharp concentration inequalities for sub-Weibull random variables, including sub-Gaussian and sub-exponential distributions. Although the random variables may not be sub-Guassian, the tail…
Synchronous update schemes in p-bit annealing offer a natural route to massive parallelism, but they can also induce period-2 oscillations that degrade optimization performance. In practical solvers, such oscillations matter only if they…
Conditional independence is central to modern statistics, but beyond special parametric families it rarely admits an exact covariance characterization. We introduce the binary expansion group intersection network (BEGIN), a…
Stochastic processes are often represented through orthonormal series expansions, a framework originating in the classical works of Lo\`eve and Karhunen and widely used for simulation and numerical approximation. While truncation error in…
We present the first method for assessing the relevance of a model-based clustering result in a general framework. Standard validation criteria, like the adjusted Rand index, rely on external labels to assess partition accuracy;…
We study nonparametric estimation of univariate cumulative distribution functions (CDFs) pertaining to data missing at random. The proposed estimators smooth the inverse probability weighted (IPW) empirical CDF with the Bernstein operator,…
In the present paper, we discuss for the first time the theoretical Kendall correlation coefficient for non-identical bivariate data. In the non-identical case, we first introduce a theoretical Kendall correlation coefficient $\tau_n$ and…
Under network interference, the treatment given to one unit may also affect the outcomes of its neighboring units in an exposure graph. Existing large-sample theory has focused on settings where either the exposure graph is sparse, or the…
Stochastic network models play a central role across a wide range of scientific disciplines, and questions of statistical inference arise naturally in this context. In this paper we investigate goodness-of-fit and two-sample testing…
Consider the additive Gaussian model $Y = X + \sigma Z$, where $X \sim P$ is an unknown signal, $Z \sim N(0,1)$ is independent of $X$, and $\sigma > 0$ is known. Let $Q$ denote the law of $Y$. We construct a hierarchy of denoisers $T_0,…
Finite-sample upper bounds on the estimation error of a winsorized mean estimator of the population mean in the presence of heavy tails and adversarial contamination are established. In comparison to existing results, the winsorized mean…
Recent years have witnessed much progress on Gaussian and bootstrap approximations to the distribution of sums of independent random vectors with dimension $d$ large relative to the sample size $n$. However, for any number of moments $m>2$…
We study the problem of detecting local geometry in random graphs. We introduce a model $\mathcal{G}(n, p, d, k)$, where a hidden community of average size $k$ has edges drawn as a random geometric graph on $\mathbb{S}^{d-1}$, while all…
While the mathematical foundations of score-based generative models are increasingly well understood for unconstrained Euclidean spaces, many practical applications involve data restricted to bounded domains. This paper provides a…
Recently, Forr\'e (arXiv:2104.11547, 2021) introduced transitional conditional independence, a notion of conditional independence that provides a unified framework for both random and non-stochastic variables. The original paper establishes…
Mat\'ern random fields are one of the most widely used classes of models in spatial statistics. The fixed-domain identifiability of covariance parameters for stationary Mat\'ern Gaussian random fields exhibits a dimension-dependent phase…
In this paper, we study semiparametric inference for linear multivariate Hawkes processes, a class of point processes widely used to describe self and mutually exciting phenomena. We establish a convolution theorem giving the best limiting…
\cite{tsagris2025a} proposed the generalized circular projected Cauchy (GCPC) distribution, whose special case is the wrapped Cauchy distribution. In this paper we first derive the relationship with the wrapped Cauchy distribution, and then…
We develop a central limit theorem (CLT) for a non-parametric estimator of the transition matrices in controlled Markov chains (CMCs) with finite state-action spaces. Our results establish precise conditions on the logging policy under…