Related papers: Kernel Mean Estimation and Stein's Effect

Kernel Mean Shrinkage Estimators

A mean function in a reproducing kernel Hilbert space (RKHS), or a kernel mean, is central to kernel methods in that it is used by many classical algorithms such as kernel principal component analysis, and it also forms the core inference…

Machine Learning · Statistics 2016-02-26 Krikamol Muandet , Bharath Sriperumbudur , Kenji Fukumizu , Arthur Gretton , Bernhard Schölkopf

Kernel Mean Estimation by Marginalized Corrupted Distributions

Estimating the kernel mean in a reproducing kernel Hilbert space is a critical component in many kernel learning algorithms. Given a finite sample, the standard estimate of the target kernel mean is the empirical average. Previous works…

Machine Learning · Computer Science 2021-07-13 Xiaobo Xia , Shuo Shan , Mingming Gong , Nannan Wang , Fei Gao , Haikun Wei , Tongliang Liu

Kernel Mean Estimation via Spectral Filtering

The problem of estimating the kernel mean in a reproducing kernel Hilbert space (RKHS) is central to kernel methods in that it is used by classical approaches (e.g., when centering a kernel PCA matrix), and it also forms the core inference…

Machine Learning · Statistics 2014-11-05 Krikamol Muandet , Bharath Sriperumbudur , Bernhard Schölkopf

Data-driven Random Fourier Features using Stein Effect

Large-scale kernel approximation is an important problem in machine learning research. Approaches using random Fourier features have become increasingly popular [Rahimi and Recht, 2007], where kernel approximation is treated as empirical…

Machine Learning · Computer Science 2017-05-25 Wei-Cheng Chang , Chun-Liang Li , Yiming Yang , Barnabas Poczos

Variance-Aware Estimation of Kernel Mean Embedding

An important feature of kernel mean embeddings (KME) is that the rate of convergence of the empirical KME to the true distribution KME can be bounded independently of the dimension of the space, properties of the distribution and smoothness…

Statistics Theory · Mathematics 2025-04-17 Geoffrey Wolfer , Pierre Alquier

Kernel Mean Embedding of Distributions: A Review and Beyond

A Hilbert space embedding of a distribution---in short, a kernel mean embedding---has recently emerged as a powerful tool for machine learning and inference. The basic idea behind this framework is to map distributions into a reproducing…

Machine Learning · Statistics 2020-12-15 Krikamol Muandet , Kenji Fukumizu , Bharath Sriperumbudur , Bernhard Schölkopf

Reproducing kernel Hilbert spaces in the mean field limit

Kernel methods, being supported by a well-developed theory and coming with efficient algorithms, are among the most popular and successful machine learning techniques. From a mathematical point of view, these methods rest on the concept of…

Machine Learning · Statistics 2023-03-20 Christian Fiedler , Michael Herty , Michael Rom , Chiara Segala , Sebastian Trimpe

Recursive Estimation of Conditional Kernel Mean Embeddings

Kernel mean embeddings, a widely used technique in machine learning, map probability distributions to elements of a reproducing kernel Hilbert space (RKHS). For supervised learning problems, where input-output pairs are observed, the…

Machine Learning · Statistics 2024-10-24 Ambrus Tamás , Balázs Csanád Csáji

Shrinkage Estimation of Higher Order Bochner Integrals

We consider shrinkage estimation of higher order Hilbert space valued Bochner integrals in a non-parametric setting. We propose estimators that shrink the $U$-statistic estimator of the Bochner integral towards a pre-specified target…

Statistics Theory · Mathematics 2022-07-22 Saiteja Utpala , Bharath K. Sriperumbudur

A Note on Optimizing Distributions using Kernel Mean Embeddings

Kernel mean embeddings are a popular tool that consists in representing probability measures by their infinite-dimensional mean embeddings in a reproducing kernel Hilbert space. When the kernel is characteristic, mean embeddings can be used…

Machine Learning · Computer Science 2021-06-29 Boris Muzellec , Francis Bach , Alessandro Rudi

Sparse Approximation of a Kernel Mean

Kernel means are frequently used to represent probability distributions in machine learning problems. In particular, the well known kernel density estimator and the kernel mean embedding both have the form of a kernel mean. Unfortunately,…

Machine Learning · Statistics 2015-03-03 E. Cruz Cortés , C. Scott

The Stein Effect for Frechet Means

The Frechet mean is a useful description of location for a probability distribution on a metric space that is not necessarily a vector space. This article considers simultaneous estimation of multiple Frechet means from a decision-theoretic…

Statistics Theory · Mathematics 2020-09-22 Andrew McCormack , Peter Hoff

BENK: The Beran Estimator with Neural Kernels for Estimating the Heterogeneous Treatment Effect

A method for estimating the conditional average treatment effect under condition of censored time-to-event data called BENK (the Beran Estimator with Neural Kernels) is proposed. The main idea behind the method is to apply the Beran…

Machine Learning · Computer Science 2022-11-22 Stanislav R. Kirpichenko , Lev V. Utkin , Andrei V. Konstantinov

Regularization of the Kernel Matrix via Covariance Matrix Shrinkage Estimation

The kernel trick concept, formulated as an inner product in a feature space, facilitates powerful extensions to many well-known algorithms. While the kernel matrix involves inner products in the feature space, the sample covariance matrix…

Computation · Statistics 2017-07-20 Tomer Lancewicki

Consistent Kernel Mean Estimation for Functions of Random Variables

We provide a theoretical foundation for non-parametric estimation of functions of random variables using kernel mean embeddings. We show that for any continuous function $f$, consistent estimators of the mean embedding of a random variable…

Machine Learning · Statistics 2018-06-04 Carl-Johann Simon-Gabriel , Adam Ścibior , Ilya Tolstikhin , Bernhard Schölkopf

Improving Normalization with the James-Stein Estimator

Stein's paradox holds considerable sway in high-dimensional statistics, highlighting that the sample mean, traditionally considered the de facto estimator, might not be the most efficacious in higher dimensions. To address this, the…

Computer Vision and Pattern Recognition · Computer Science 2023-12-04 Seyedalireza Khoshsirat , Chandra Kambhamettu

Nonparametric Score Estimators

Estimating the score, i.e., the gradient of log density function, from a set of samples generated by an unknown distribution is a fundamental task in inference and learning of probabilistic models that involve flexible yet intractable…

Machine Learning · Statistics 2020-07-01 Yuhao Zhou , Jiaxin Shi , Jun Zhu

Nystr\"om Kernel Mean Embeddings

Kernel mean embeddings are a powerful tool to represent probability distributions over arbitrary spaces as single points in a Hilbert space. Yet, the cost of computing and storing such embeddings prohibits their direct use in large-scale…

Machine Learning · Statistics 2022-06-16 Antoine Chatalic , Nicolas Schreuder , Alessandro Rudi , Lorenzo Rosasco

MONK -- Outlier-Robust Mean Embedding Estimation by Median-of-Means

Mean embeddings provide an extremely flexible and powerful tool in machine learning and statistics to represent probability distributions and define a semi-metric (MMD, maximum mean discrepancy; also called N-distance or energy distance),…

Machine Learning · Statistics 2019-05-17 Matthieu Lerasle , Zoltan Szabo , Timothee Mathieu , Guillaume Lecue

High-Dimensional Multi-Task Averaging and Application to Kernel Mean Embedding

We propose an improved estimator for the multi-task averaging problem, whose goal is the joint estimation of the means of multiple distributions using separate, independent data sets. The naive approach is to take the empirical mean of each…

Machine Learning · Statistics 2020-11-16 Hannah Marienwald , Jean-Baptiste Fermanian , Gilles Blanchard