Related papers: Kernel Mean Embedding Based Hypothesis Tests for C…

Bayesian Learning of Kernel Embeddings

Kernel methods are one of the mainstays of machine learning, but the problem of kernel learning remains challenging, with only a few heuristics and very little theory. This is of particular importance in methods based on estimation of…

Machine Learning · Statistics 2016-06-03 Seth Flaxman , Dino Sejdinovic , John P. Cunningham , Sarah Filippi

Comparing Foundation Models using Data Kernels

Recent advances in self-supervised learning and neural network scaling have enabled the creation of large models, known as foundation models, which can be easily adapted to a wide range of downstream tasks. The current paradigm for…

Machine Learning · Computer Science 2024-01-09 Brandon Duderstadt , Hayden S. Helm , Carey E. Priebe

Kernel Mean Embedding of Probability Measures and its Applications to Functional Data Analysis

This study intends to introduce kernel mean embedding of probability measures over infinite-dimensional separable Hilbert spaces induced by functional response statistical models. The embedded function represents the concentration of…

Statistics Theory · Mathematics 2020-11-05 Saeed Hayati , Kenji Fukumizu , Afshin Parvardeh

A Note on Optimizing Distributions using Kernel Mean Embeddings

Kernel mean embeddings are a popular tool that consists in representing probability measures by their infinite-dimensional mean embeddings in a reproducing kernel Hilbert space. When the kernel is characteristic, mean embeddings can be used…

Machine Learning · Computer Science 2021-06-29 Boris Muzellec , Francis Bach , Alessandro Rudi

Bayesian Kernel Two-Sample Testing

In modern data analysis, nonparametric measures of discrepancies between random variables are particularly important. The subject is well-studied in the frequentist literature, while the development in the Bayesian setting is limited where…

Methodology · Statistics 2022-01-25 Qinyi Zhang , Veit Wild , Sarah Filippi , Seth Flaxman , Dino Sejdinovic

Sparse Approximation of a Kernel Mean

Kernel means are frequently used to represent probability distributions in machine learning problems. In particular, the well known kernel density estimator and the kernel mean embedding both have the form of a kernel mean. Unfortunately,…

Machine Learning · Statistics 2015-03-03 E. Cruz Cortés , C. Scott

Kernel Mean Embedding of Distributions: A Review and Beyond

A Hilbert space embedding of a distribution---in short, a kernel mean embedding---has recently emerged as a powerful tool for machine learning and inference. The basic idea behind this framework is to map distributions into a reproducing…

Machine Learning · Statistics 2020-12-15 Krikamol Muandet , Kenji Fukumizu , Bharath Sriperumbudur , Bernhard Schölkopf

Towards an Explainable Comparison and Alignment of Feature Embeddings

While several feature embedding models have been developed in the literature, comparisons of these embeddings have largely focused on their numerical performance in classification-related downstream applications. However, an interpretable…

Machine Learning · Computer Science 2025-08-19 Mohammad Jalali , Bahar Dibaei Nia , Farzan Farnia

Kernel-Based Generalized Median Computation for Consensus Learning

Computing a consensus object from a set of given objects is a core problem in machine learning and pattern recognition. One popular approach is to formulate it as an optimization problem using the generalized median. Previous methods like…

Computer Vision and Pattern Recognition · Computer Science 2022-09-22 Andreas Nienkötter , Xiaoyi Jiang

A Measure-Theoretic Approach to Kernel Conditional Mean Embeddings

We present an operator-free, measure-theoretic approach to the conditional mean embedding (CME) as a random variable taking values in a reproducing kernel Hilbert space. While the kernel mean embedding of unconditional distributions has…

Machine Learning · Computer Science 2021-01-11 Junhyung Park , Krikamol Muandet

Model-Free Kernel Conformal Depth Measures Algorithm for Uncertainty Quantification in Regression Models in Separable Hilbert Spaces

Depth measures are powerful tools for defining level sets in emerging, non--standard, and complex random objects such as high-dimensional multivariate data, functional data, and random graphs. Despite their favorable theoretical properties,…

Machine Learning · Statistics 2025-06-11 Marcos Matabuena , Rahul Ghosal , Pavlo Mozharovskyi , Oscar Hernan Madrid Padilla , Jukka-Pekka Onnela

A Dictionary of Closed-Form Kernel Mean Embeddings

Kernel mean embeddings -- integrals of a kernel with respect to a probability distribution -- are essential in Bayesian quadrature, but also widely used in other computational tools for numerical integration or for statistical inference…

Machine Learning · Statistics 2025-04-29 François-Xavier Briol , Alexandra Gessner , Toni Karvonen , Maren Mahsereci

Quantum Mean Embedding of Probability Distributions

The kernel mean embedding of probability distributions is commonly used in machine learning as an injective mapping from distributions to functions in an infinite dimensional Hilbert space. It allows us, for example, to define a distance…

Quantum Physics · Physics 2019-12-24 Jonas M. Kübler , Krikamol Muandet , Bernhard Schölkopf

Variance-Aware Estimation of Kernel Mean Embedding

An important feature of kernel mean embeddings (KME) is that the rate of convergence of the empirical KME to the true distribution KME can be bounded independently of the dimension of the space, properties of the distribution and smoothness…

Statistics Theory · Mathematics 2025-04-17 Geoffrey Wolfer , Pierre Alquier

Likelihood Ratio Tests by Kernel Gaussian Embedding

We propose a novel kernel-based nonparametric two-sample test, employing the combined use of kernel mean and kernel covariance embedding. Our test builds on recent results showing how such combined embeddings map distinct probability…

Machine Learning · Statistics 2025-09-16 Leonardo V. Santoro , Victor M. Panaretos

Keep it Tighter -- A Story on Analytical Mean Embeddings

Kernel techniques are among the most popular and flexible approaches in data science allowing to represent probability measures without loss of information under mild conditions. The resulting mapping called mean embedding gives rise to a…

Machine Learning · Statistics 2024-11-27 Linda Chamakh , Zoltan Szabo

The Exact Equivalence of Distance and Kernel Methods for Hypothesis Testing

Distance-based tests, also called "energy statistics", are leading methods for two-sample and independence tests from the statistics community. Kernel-based tests, developed from "kernel mean embeddings", are leading methods for two-sample…

Machine Learning · Statistics 2024-06-27 Cencheng Shen , Joshua T. Vogelstein

Nystr\"om Kernel Mean Embeddings

Kernel mean embeddings are a powerful tool to represent probability distributions over arbitrary spaces as single points in a Hilbert space. Yet, the cost of computing and storing such embeddings prohibits their direct use in large-scale…

Machine Learning · Statistics 2022-06-16 Antoine Chatalic , Nicolas Schreuder , Alessandro Rudi , Lorenzo Rosasco

Composite Goodness-of-fit Tests with Kernels

Model misspecification can create significant challenges for the implementation of probabilistic models, and this has led to development of a range of robust methods which directly account for this issue. However, whether these more…

Machine Learning · Statistics 2025-04-22 Oscar Key , Arthur Gretton , François-Xavier Briol , Tamara Fernandez

Exact Distribution-Free Hypothesis Tests for the Regression Function of Binary Classification via Conditional Kernel Mean Embeddings

In this paper we suggest two statistical hypothesis tests for the regression function of binary classification based on conditional kernel mean embeddings. The regression function is a fundamental object in classification as it determines…

Machine Learning · Statistics 2022-06-22 Ambrus Tamás , Balázs Csanád Csáji