Related papers: Generalization Guarantees for Sparse Kernel Approx…

Gaussian Quadrature for Kernel Features

Kernel methods have recently attracted resurgent interest, showing performance competitive with deep neural networks in tasks such as speech recognition. The random Fourier features map is a technique commonly used to scale up kernel…

Machine Learning · Computer Science 2018-02-01 Tri Dao , Christopher De Sa , Christopher Ré

Kernel Thinning

We introduce kernel thinning, a new procedure for compressing a distribution $\mathbb{P}$ more effectively than i.i.d. sampling or standard thinning. Given a suitable reproducing kernel $\mathbf{k}_{\star}$ and $O(n^2)$ time, kernel…

Machine Learning · Statistics 2024-05-14 Raaz Dwivedi , Lester Mackey

Improved Convergence Rates for Sparse Approximation Methods in Kernel-Based Learning

Kernel-based models such as kernel ridge regression and Gaussian processes are ubiquitous in machine learning applications for regression and optimization. It is well known that a major downside for kernel-based models is the high…

Machine Learning · Computer Science 2022-06-22 Sattar Vakili , Jonathan Scarlett , Da-shan Shiu , Alberto Bernacchia

Learning with Optimized Random Features: Exponential Speedup by Quantum Machine Learning without Sparsity and Low-Rank Assumptions

Kernel methods augmented with random features give scalable algorithms for learning from big data. But it has been computationally hard to sample random features according to a probability distribution that is optimized for the data, so as…

Quantum Physics · Physics 2021-11-02 Hayata Yamasaki , Sathyawageeswar Subramanian , Sho Sonoda , Masato Koashi

Sparse Approximation of a Kernel Mean

Kernel means are frequently used to represent probability distributions in machine learning problems. In particular, the well known kernel density estimator and the kernel mean embedding both have the form of a kernel mean. Unfortunately,…

Machine Learning · Statistics 2015-03-03 E. Cruz Cortés , C. Scott

Near Input Sparsity Time Kernel Embeddings via Adaptive Sampling

To accelerate kernel methods, we propose a near input sparsity time algorithm for sampling the high-dimensional feature space implicitly defined by a kernel transformation. Our main contribution is an importance sampling method for…

Data Structures and Algorithms · Computer Science 2020-07-15 David P. Woodruff , Amir Zandieh

Optimal Rates for Random Fourier Features

Kernel methods represent one of the most powerful tools in machine learning to tackle problems expressed in terms of function values and derivatives due to their capability to represent and model complex relations. While these methods show…

Statistics Theory · Mathematics 2015-11-06 Bharath K. Sriperumbudur , Zoltan Szabo

On Kernel Derivative Approximation with Random Fourier Features

Random Fourier features (RFF) represent one of the most popular and wide-spread techniques in machine learning to scale up kernel algorithms. Despite the numerous successful applications of RFFs, unfortunately, quite little is understood…

Machine Learning · Statistics 2019-02-12 Zoltan Szabo , Bharath K. Sriperumbudur

Exact Gaussian Processes for Massive Datasets via Non-Stationary Sparsity-Discovering Kernels

A Gaussian Process (GP) is a prominent mathematical framework for stochastic function approximation in science and engineering applications. This success is largely attributed to the GP's analytical tractability, robustness, non-parametric…

Machine Learning · Statistics 2022-05-19 Marcus M. Noack , Harinarayan Krishnan , Mark D. Risser , Kristofer G. Reyes

Sparse Feature Selection in Kernel Discriminant Analysis via Optimal Scoring

We consider the two-group classification problem and propose a kernel classifier based on the optimal scoring framework. Unlike previous approaches, we provide theoretical guarantees on the expected risk consistency of the method. We also…

Machine Learning · Statistics 2021-04-01 Alexander F. Lapanowski , Irina Gaynanova

Low-Precision Random Fourier Features for Memory-Constrained Kernel Approximation

We investigate how to train kernel approximation methods that generalize well under a memory budget. Building on recent theoretical work, we define a measure of kernel approximation error which we find to be more predictive of the empirical…

Machine Learning · Computer Science 2019-03-21 Jian Zhang , Avner May , Tri Dao , Christopher Ré

Learning Bounds for Greedy Approximation with Explicit Feature Maps from Multiple Kernels

Nonlinear kernels can be approximated using finite-dimensional feature maps for efficient risk minimization. Due to the inherent trade-off between the dimension of the (mapped) feature space and the approximation accuracy, the key problem…

Machine Learning · Computer Science 2018-10-10 Shahin Shahrampour , Vahid Tarokh

Random Fourier Features for Kernel Ridge Regression: Approximation Bounds and Statistical Guarantees

Random Fourier features is one of the most popular techniques for scaling up kernel methods, such as kernel ridge regression. However, despite impressive empirical results, the statistical properties of random Fourier features are still not…

Machine Learning · Computer Science 2018-05-22 Haim Avron , Michael Kapralov , Cameron Musco , Christopher Musco , Ameya Velingker , Amir Zandieh

Denoising Score Matching with Random Fourier Features

The density estimation is one of the core problems in statistics. Despite this, existing techniques like maximum likelihood estimation are computationally inefficient due to the intractability of the normalizing constant. For this reason an…

Machine Learning · Computer Science 2021-01-14 Tsimboy Olga , Yermek Kapushev , Evgeny Burnaev , Ivan Oseledets

On the Error of Random Fourier Features

Kernel methods give powerful, flexible, and theoretically grounded approaches to solving many problems in machine learning. The standard approach, however, requires pairwise evaluations of a kernel function, which can lead to scalability…

Machine Learning · Computer Science 2021-04-08 Danica J. Sutherland , Jeff Schneider

On the Equivalence between Kernel Quadrature Rules and Random Feature Expansions

We show that kernel-based quadrature rules for computing integrals can be seen as a special case of random feature expansions for positive definite kernels, for a particular decomposition that always exists for such kernels. We provide a…

Machine Learning · Computer Science 2015-11-10 Francis Bach

Orthogonal Random Features

We present an intriguing discovery related to Random Fourier Features: in Gaussian kernel approximation, replacing the random Gaussian matrix by a properly scaled random orthogonal matrix significantly decreases kernel approximation error.…

Machine Learning · Computer Science 2016-10-31 Felix X. Yu , Ananda Theertha Suresh , Krzysztof Choromanski , Daniel Holtmann-Rice , Sanjiv Kumar

Nystr\"om Kernel Mean Embeddings

Kernel mean embeddings are a powerful tool to represent probability distributions over arbitrary spaces as single points in a Hilbert space. Yet, the cost of computing and storing such embeddings prohibits their direct use in large-scale…

Machine Learning · Statistics 2022-06-16 Antoine Chatalic , Nicolas Schreuder , Alessandro Rudi , Lorenzo Rosasco

Entropy of Overcomplete Kernel Dictionaries

In signal analysis and synthesis, linear approximation theory considers a linear decomposition of any given signal in a set of atoms, collected into a so-called dictionary. Relevant sparse representations are obtained by relaxing the…

Information Theory · Computer Science 2014-11-04 Paul Honeine

Generalization Bounds for Sparse Random Feature Expansions

Random feature methods have been successful in various machine learning tasks, are easy to compute, and come with theoretical accuracy bounds. They serve as an alternative approach to standard neural networks since they can represent…

Machine Learning · Statistics 2026-01-21 Abolfazl Hashemi , Hayden Schaeffer , Robert Shi , Ufuk Topcu , Giang Tran , Rachel Ward