Related papers: Fast Randomized Kernel Methods With Statistical Gu…

Fast Statistical Leverage Score Approximation in Kernel Ridge Regression

Nystr\"om approximation is a fast randomized method that rapidly solves kernel ridge regression (KRR) problems through sub-sampling the n-by-n empirical kernel matrix appearing in the objective function. However, the performance of such a…

Machine Learning · Statistics 2021-03-10 Yifan Chen , Yun Yang

On Fast Leverage Score Sampling and Optimal Learning

Leverage score sampling provides an appealing way to perform approximate computations for large matrices. Indeed, it allows to derive faithful approximations with a complexity adapted to the problem at hand. Yet, performing leverage scores…

Machine Learning · Statistics 2019-01-25 Alessandro Rudi , Daniele Calandriello , Luigi Carratino , Lorenzo Rosasco

Sharp analysis of low-rank kernel matrix approximations

We consider supervised learning problems within the positive-definite kernel framework, such as kernel ridge regression, kernel logistic regression or the support vector machine. With kernels leading to infinite-dimensional feature spaces,…

Machine Learning · Computer Science 2013-05-23 Francis Bach

Random Fourier Features for Kernel Ridge Regression: Approximation Bounds and Statistical Guarantees

Random Fourier features is one of the most popular techniques for scaling up kernel methods, such as kernel ridge regression. However, despite impressive empirical results, the statistical properties of random Fourier features are still not…

Machine Learning · Computer Science 2018-05-22 Haim Avron , Michael Kapralov , Cameron Musco , Christopher Musco , Ameya Velingker , Amir Zandieh

Approximate Kernel PCA Using Random Features: Computational vs. Statistical Trade-off

Kernel methods are powerful learning methodologies that allow to perform non-linear data analysis. Despite their popularity, they suffer from poor scalability in big data scenarios. Various approximation methods, including random feature…

Machine Learning · Statistics 2022-06-14 Bharath Sriperumbudur , Nicholas Sterge

Randomized sketches for kernels: Fast and optimal non-parametric regression

Kernel ridge regression (KRR) is a standard method for performing non-parametric regression over reproducing kernel Hilbert spaces. Given $n$ samples, the time and space complexity of computing the KRR estimate scale as $\mathcal{O}(n^3)$…

Machine Learning · Statistics 2015-01-27 Yun Yang , Mert Pilanci , Martin J. Wainwright

Sketch In, Sketch Out: Accelerating both Learning and Inference for Structured Prediction with Kernels

Leveraging the kernel trick in both the input and output spaces, surrogate kernel methods are a flexible and theoretically grounded solution to structured output prediction. If they provide state-of-the-art performance on complex data sets…

Machine Learning · Statistics 2024-05-07 Tamim El Ahmad , Luc Brogat-Motte , Pierre Laforgue , Florence d'Alché-Buc

Learning with Optimized Random Features: Exponential Speedup by Quantum Machine Learning without Sparsity and Low-Rank Assumptions

Kernel methods augmented with random features give scalable algorithms for learning from big data. But it has been computationally hard to sample random features according to a probability distribution that is optimized for the data, so as…

Quantum Physics · Physics 2021-11-02 Hayata Yamasaki , Sathyawageeswar Subramanian , Sho Sonoda , Masato Koashi

Fast Kernel Methods for Generic Lipschitz Losses via $p$-Sparsified Sketches

Kernel methods are learning algorithms that enjoy solid theoretical foundations while suffering from important computational limitations. Sketching, which consists in looking for solutions among a subspace of reduced dimension, is a well…

Machine Learning · Statistics 2023-11-07 Tamim El Ahmad , Pierre Laforgue , Florence d'Alché-Buc

Random Features for Kernel Approximation: A Survey on Algorithms, Theory, and Beyond

Random features is one of the most popular techniques to speed up kernel methods in large-scale problems. Related works have been recognized by the NeurIPS Test-of-Time award in 2017 and the ICML Best Paper Finalist in 2019. The body of…

Machine Learning · Statistics 2021-07-13 Fanghui Liu , Xiaolin Huang , Yudong Chen , Johan A. K. Suykens

Data Dependent Kernel Approximation using Pseudo Random Fourier Features

Kernel methods are powerful and flexible approach to solve many problems in machine learning. Due to the pairwise evaluations in kernel methods, the complexity of kernel computation grows as the data size increases; thus the applicability…

Machine Learning · Computer Science 2017-11-28 Bharath Bhushan Damodaran , Nicolas Courty , Philippe-Henri Gosselin

A Statistical Perspective on Algorithmic Leveraging

One popular method for dealing with large-scale data sets is sampling. For example, by using the empirical statistical leverage scores as an importance sampling distribution, the method of algorithmic leveraging samples and rescales…

Methodology · Statistics 2013-06-25 Ping Ma , Michael W. Mahoney , Bin Yu

Random Fourier Features via Fast Surrogate Leverage Weighted Sampling

In this paper, we propose a fast surrogate leverage weighted sampling strategy to generate refined random Fourier features for kernel approximation. Compared to the current state-of-the-art method that uses the leverage weighted scheme…

Machine Learning · Computer Science 2019-11-22 Fanghui Liu , Xiaolin Huang , Yudong Chen , Jie Yang , Johan A. K. Suykens

On Sampling Random Features From Empirical Leverage Scores: Implementation and Theoretical Guarantees

Random features provide a practical framework for large-scale kernel approximation and supervised learning. It has been shown that data-dependent sampling of random features using leverage scores can significantly reduce the number of…

Machine Learning · Computer Science 2019-03-21 Shahin Shahrampour , Soheil Kolouri

Randomized algorithms for matrices and data

Randomized algorithms for very large matrix problems have received a great deal of attention in recent years. Much of this work was motivated by problems in large-scale data analysis, and this work was performed by individuals from many…

Data Structures and Algorithms · Computer Science 2011-11-16 Michael W. Mahoney

Optimal Rates for Random Fourier Features

Kernel methods represent one of the most powerful tools in machine learning to tackle problems expressed in terms of function values and derivatives due to their capability to represent and model complex relations. While these methods show…

Statistics Theory · Mathematics 2015-11-06 Bharath K. Sriperumbudur , Zoltan Szabo

Random Machines: A bagged-weighted support vector model with free kernel choice

Improvement of statistical learning models in order to increase efficiency in solving classification or regression problems is still a goal pursued by the scientific community. In this way, the support vector machine model is one of the…

Machine Learning · Statistics 2019-11-22 Anderson Ara , Mateus Maia , Samuel Macêdo , Francisco Louzada

Sub-quadratic Algorithms for Kernel Matrices via Kernel Density Estimation

Kernel matrices, as well as weighted graphs represented by them, are ubiquitous objects in machine learning, statistics and other related fields. The main drawback of using kernel methods (learning and inference using kernel matrices) is…

Machine Learning · Computer Science 2022-12-02 Ainesh Bakshi , Piotr Indyk , Praneeth Kacham , Sandeep Silwal , Samson Zhou

Simultaneous Model Selection and Optimization through Parameter-free Stochastic Learning

Stochastic gradient descent algorithms for training linear and kernel predictors are gaining more and more importance, thanks to their scalability. While various methods have been proposed to speed up their convergence, the model selection…

Machine Learning · Computer Science 2014-06-17 Francesco Orabona

A Framework for Statistical Inference via Randomized Algorithms

Randomized algorithms, such as randomized sketching or stochastic optimization, are a promising approach to ease the computational burden in analyzing large datasets. However, randomized algorithms also produce non-deterministic outputs,…

Methodology · Statistics 2025-05-13 Zhixiang Zhang , Sokbae Lee , Edgar Dobriban