Related papers: Generalized Kernel Thinning

Supervised Kernel Thinning

The kernel thinning algorithm of Dwivedi & Mackey (2024) provides a better-than-i.i.d. compression of a generic set of points. By generating high-fidelity coresets of size significantly smaller than the input points, KT is known to speed up…

Machine Learning · Computer Science 2025-01-16 Albert Gong , Kyuseong Choi , Raaz Dwivedi

Kernel Thinning

We introduce kernel thinning, a new procedure for compressing a distribution $\mathbb{P}$ more effectively than i.i.d. sampling or standard thinning. Given a suitable reproducing kernel $\mathbf{k}_{\star}$ and $O(n^2)$ time, kernel…

Machine Learning · Statistics 2024-05-14 Raaz Dwivedi , Lester Mackey

Debiased Distribution Compression

Modern compression methods can summarize a target distribution $\mathbb{P}$ more succinctly than i.i.d. sampling but require access to a low-bias input sequence like a Markov chain converging quickly to $\mathbb{P}$. We introduce a new…

Machine Learning · Statistics 2024-08-02 Lingxiao Li , Raaz Dwivedi , Lester Mackey

RFN: A Random-Feature Based Newton Method for Empirical Risk Minimization in Reproducing Kernel Hilbert Spaces

In supervised learning using kernel methods, we often encounter a large-scale finite-sum minimization over a reproducing kernel Hilbert space (RKHS). Large-scale finite-sum problems can be solved using efficient variants of Newton method,…

Machine Learning · Computer Science 2022-06-07 Ting-Jui Chang , Shahin Shahrampour

A short note on extension theorems and their connection to universal consistency in machine learning

Statistical machine learning plays an important role in modern statistics and computer science. One main goal of statistical machine learning is to provide universally consistent algorithms, i.e., the estimator converges in probability or…

Machine Learning · Statistics 2016-04-18 Andreas Christmann , Florian Dumpert , Dao-Hong Xiang

Coefficient-based Regularized Distribution Regression

In this paper, we consider the coefficient-based regularized distribution regression which aims to regress from probability measures to real-valued responses over a reproducing kernel Hilbert space (RKHS), where the regularization is put on…

Machine Learning · Statistics 2022-08-29 Yuan Mao , Lei Shi , Zheng-Chu Guo

The Fast Kernel Transform

Kernel methods are a highly effective and widely used collection of modern machine learning algorithms. A fundamental limitation of virtually all such methods are computations involving the kernel matrix that naively scale quadratically…

Machine Learning · Computer Science 2021-06-09 John Paul Ryan , Sebastian Ament , Carla P. Gomes , Anil Damle

Kernel Mean Embedding of Distributions: A Review and Beyond

A Hilbert space embedding of a distribution---in short, a kernel mean embedding---has recently emerged as a powerful tool for machine learning and inference. The basic idea behind this framework is to map distributions into a reproducing…

Machine Learning · Statistics 2020-12-15 Krikamol Muandet , Kenji Fukumizu , Bharath Sriperumbudur , Bernhard Schölkopf

Monte Carlo with kernel-based Gibbs measures: Guarantees for probabilistic herding

Kernel herding belongs to a family of deterministic quadratures that seek to minimize the worst-case integration error over a reproducing kernel Hilbert space (RKHS). These quadrature rules come with strong experimental evidence that this…

Machine Learning · Computer Science 2025-08-12 Martin Rouault , Rémi Bardenet , Mylène Maïda

Distribution Compression in Near-linear Time

In distribution compression, one aims to accurately summarize a probability distribution $\mathbb{P}$ using a small number of representative points. Near-optimal thinning procedures achieve this goal by sampling $n$ points from a Markov…

Machine Learning · Statistics 2022-10-19 Abhishek Shetty , Raaz Dwivedi , Lester Mackey

Kernel Quantile Embeddings and Associated Probability Metrics

Embedding probability distributions into reproducing kernel Hilbert spaces (RKHS) has enabled powerful nonparametric methods such as the maximum mean discrepancy (MMD), a statistical distance with strong theoretical and computational…

Machine Learning · Statistics 2025-05-28 Masha Naslidnyk , Siu Lun Chau , François-Xavier Briol , Krikamol Muandet

Kernel Truncated Randomized Ridge Regression: Optimal Rates and Low Noise Acceleration

In this paper, we consider the nonparametric least square regression in a Reproducing Kernel Hilbert Space (RKHS). We propose a new randomized algorithm that has optimal generalization error bounds with respect to the square loss, closing a…

Machine Learning · Computer Science 2019-05-28 Kwang-Sung Jun , Ashok Cutkosky , Francesco Orabona

Reproducing kernel methods for machine learning, PDEs, and statistics

This monograph develops a unified, application-driven framework for kernel methods grounded in reproducing kernel Hilbert spaces (RKHS) and optimal transport (OT). Part I lays the theoretical and numerical foundations on positive-definite…

Numerical Analysis · Mathematics 2025-10-07 Philippe G. LeFloch , Jean-Marc Mercier , Shohruh Miryusupov

A Forward Backward Greedy approach for Sparse Multiscale Learning

Multiscale Models are known to be successful in uncovering and analyzing the structures in data at different resolutions. In the current work we propose a feature driven Reproducing Kernel Hilbert space (RKHS), for which the associated…

Machine Learning · Computer Science 2022-08-24 Prashant Shekhar , Abani Patra

Kernel Treelets

A new method for hierarchical clustering is presented. It combines treelets, a particular multiscale decomposition of data, with a projection on a reproducing kernel Hilbert space. The proposed approach, called kernel treelets (KT),…

Machine Learning · Statistics 2019-07-24 Hedi Xia , Hector D. Ceniceros

Kernel Distribution Embeddings: Universal Kernels, Characteristic Kernels and Kernel Metrics on Distributions

Kernel mean embeddings have recently attracted the attention of the machine learning community. They map measures $\mu$ from some set $M$ to functions in a reproducing kernel Hilbert space (RKHS) with kernel $k$. The RKHS distance of two…

Machine Learning · Statistics 2019-12-18 Carl-Johann Simon-Gabriel , Bernhard Schölkopf

A Note on Optimizing Distributions using Kernel Mean Embeddings

Kernel mean embeddings are a popular tool that consists in representing probability measures by their infinite-dimensional mean embeddings in a reproducing kernel Hilbert space. When the kernel is characteristic, mean embeddings can be used…

Machine Learning · Computer Science 2021-06-29 Boris Muzellec , Francis Bach , Alessandro Rudi

Keep it Tighter -- A Story on Analytical Mean Embeddings

Kernel techniques are among the most popular and flexible approaches in data science allowing to represent probability measures without loss of information under mild conditions. The resulting mapping called mean embedding gives rise to a…

Machine Learning · Statistics 2024-11-27 Linda Chamakh , Zoltan Szabo

Spectrally-truncated kernel ridge regression and its free lunch

Kernel ridge regression (KRR) is a well-known and popular nonparametric regression approach with many desirable properties, including minimax rate-optimality in estimating functions that belong to common reproducing kernel Hilbert spaces…

Machine Learning · Statistics 2019-10-15 Arash A. Amini

Notes on Kernel Methods in Machine Learning

These notes provide a self-contained introduction to kernel methods and their geometric foundations in machine learning. Starting from the construction of Hilbert spaces, we develop the theory of positive definite kernels, reproducing…

Machine Learning · Computer Science 2025-11-19 Diego Armando Pérez-Rosero , Danna Valentina Salazar-Dubois , Juan Camilo Lugo-Rojas , Andrés Marino Álvarez-Meza , Germán Castellanos-Dominguez