统计理论
Sliced Wasserstein distances are widely used in practice as a computationally efficient alternative to Wasserstein distances in high dimensions. In this paper, motivated by theoretical foundations of this alternative, we prove quantitative…
Many researchers have applied classical statistical decision theory to evaluate treatment choices and learn optimal policies. However, because this framework is based solely on realized outcomes under chosen decisions and ignores…
This paper addresses the challenge of integrating sequentially arriving data within the quantile regression framework, where the number of features is allowed to grow with the number of observations, the horizon is unknown, and memory is…
This paper revisits the classical problem of determining the bias of a weighted coin, where the bias is known to be either $p = 1/2 + \varepsilon$ or $p = 1/2 - \varepsilon$, while minimizing the expected number of coin tosses and the error…
Extending rank-based inference to a multivariate setting such as multiple-output regression or MANOVA with unspecified d-dimensional error density has remained an open problem for more than half a century. None of the many solutions…
Model selection criteria are one of the most important tools in statistics. Proofs showing a model selection criterion is asymptotically optimal are tailored to the type of model (linear regression, quantile regression, penalized…
We consider a new method for estimating the parameters of univariate Gaussian mixture models. The method relies on a nonparametric density estimator $\hat{f}_n$ (typically a kernel estimator). For every set of Gaussian mixture components,…
The geometrical structure of PLS shrinkages is here considered. Firstly, an explicit formula for the shrinkage vector is provided. In that expression, shrinkage factors are expressed a averages of a set of basic shrinkages that depend only…
This paper deals with the problem of optimal mean-square filtering of the linear functionals $A{\xi}=\int_{0}^{\infty}a(t)\xi(-t)dt$ and $A_T{\xi}=\int_{0}^Ta(t)\xi(-t)dt$ which depend on the unknown values of random process $\xi(t)$ with…
The problem of optimal estimation of linear functionals $A {\xi}=\int_{0}^{\infty} a(t)\xi(t)dt$ and $A_T{\xi}=\int_{0}^{T} a(t)\xi(t)dt$ depending on the unknown values of random process $\xi(t)$, $t\in R$, with stationary $n$th increments…
Reproducing Kernel Hilbert Space (RKHS) embedding of probability distributions has proved to be an effective approach, via MMD (maximum mean discrepancy), for nonparametric hypothesis testing problems involving distributions defined over…
A fundamental problem in statistics is estimating the shape matrix of an Elliptical distribution. This generalizes the familiar problem of Gaussian covariance estimation, for which the sample covariance achieves optimal estimation error.…
Asymptotic efficiency theory is one of the pillars in the foundations of modern mathematical statistics. Not only does it serve as a rigorous theoretical benchmark for evaluating statistical methods, but it also sheds light on how to…
The generalized Marshall-Olkin Lomax distribution is introduced, and its properties are easily obtained from those of the Lomax distribution. A regression model for censored data is proposed. The parameters are estimated through maximum…
Traditional likelihood based methods for parameter estimation get highly affected when the given data is contaminated by outliers even in a small proportion. In this paper, we consider a robust parameter estimation method, namely the…
From the perspective of data reduction, the notions of minimal sufficient and complete statistics together play an important role in determining optimal statistics (estimators). The classical notion of sufficiency and completeness are not…
We consider the modulation of data given by random vectors $X_n \in \mathbb{R}^{d_n}$, $n \in \mathbb{N}$. For each $X_n$, one chooses an independent modulating random vector $\Xi_n \in \mathbb{R}^{d_n}$ and forms the projection $Y_n =…
This paper deals with a Skorokhod's integral based projection type estimator $\widehat b_m$ of the drift function $b_0$ computed from $N\in\mathbb N^*$ independent copies $X^1,\dots,X^N$ of the solution $X$ of $dX_t = b_0(X_t)dt +\sigma…
This paper deals with nonparametric estimators of the drift function $b$ computed from independent continuous observations, on a compact time interval, of the solution of a stochastic differential equation driven by the fractional Brownian…
Weak convergence of joint distributions generally does not imply convergence of conditional distributions. In particular, conditional distributions need not converge when joint Gaussian distributions converge to a singular Gaussian limit.…