Related papers: Distributed Adaptive Sampling for Kernel Matrix Ap…

Pack only the essentials: Adaptive dictionary learning for kernel ridge regression

One of the major limits of kernel ridge regression (KRR) is that storing and manipulating the kernel matrix K_n for n samples requires O(n^2) space, which rapidly becomes unfeasible for large n. Nystrom approximations reduce the space…

Machine Learning · Statistics 2026-04-27 Daniele Calandriello , Alessandro Lazaric , Michal Valko

Scalable Kernel Clustering: Approximate Kernel k-means

Kernel-based clustering algorithms have the ability to capture the non-linear structure in real world data. Among various kernel-based clustering algorithms, kernel k-means has gained popularity due to its simple iterative nature and ease…

Computer Vision and Pattern Recognition · Computer Science 2014-02-18 Radha Chitta , Rong Jin , Timothy C. Havens , Anil K. Jain

Is Input Sparsity Time Possible for Kernel Low-Rank Approximation?

Low-rank approximation is a common tool used to accelerate kernel methods: the $n \times n$ kernel matrix $K$ is approximated via a rank-$k$ matrix $\tilde K$ which can be stored in much less space and processed more quickly. In this work…

Data Structures and Algorithms · Computer Science 2017-11-07 Cameron Musco , David P. Woodruff

oASIS: Adaptive Column Sampling for Kernel Matrix Approximation

Kernel matrices (e.g. Gram or similarity matrices) are essential for many state-of-the-art approaches to classification, clustering, and dimensionality reduction. For large datasets, the cost of forming and factoring such kernel matrices…

Machine Learning · Statistics 2015-05-21 Raajen Patel , Thomas A. Goldstein , Eva L. Dyer , Azalia Mirhoseini , Richard G. Baraniuk

Multiresolution Kernel Approximation for Gaussian Process Regression

Gaussian process regression generally does not scale to beyond a few thousands data points without applying some sort of kernel approximation method. Most approximations focus on the high eigenvalue part of the spectrum of the kernel…

Machine Learning · Statistics 2018-01-31 Yi Ding , Risi Kondor , Jonathan Eskreis-Winkler

New efficient algorithms for multiple change-point detection with kernels

Several statistical approaches based on reproducing kernels have been proposed to detect abrupt changes arising in the full distribution of the observations and not only in the mean or variance. Some of these approaches enjoy good…

Statistics Theory · Mathematics 2017-10-13 Alain Celisse , Guillemette Marot , Morgane Pierre-Jean , Guillem Rigaill

Efficient Dataset Distillation Using Random Feature Approximation

Dataset distillation compresses large datasets into smaller synthetic coresets which retain performance with the aim of reducing the storage and computational burden of processing the entire dataset. Today's best-performing algorithm,…

Machine Learning · Computer Science 2022-10-24 Noel Loo , Ramin Hasani , Alexander Amini , Daniela Rus

Linear-time Learning on Distributions with Approximate Kernel Embeddings

Many interesting machine learning problems are best posed by considering instances that are distributions, or sample sets drawn from distributions. Previous work devoted to machine learning tasks with distributional inputs has done so…

Machine Learning · Statistics 2021-01-15 Danica J. Sutherland , Junier B. Oliva , Barnabás Póczos , Jeff Schneider

Recursive Sampling for the Nystr\"om Method

We give the first algorithm for kernel Nystr\"om approximation that runs in *linear time in the number of training points* and is provably accurate for all kernel matrices, without dependence on regularity or incoherence conditions. The…

Machine Learning · Computer Science 2017-11-06 Cameron Musco , Christopher Musco

Fast Randomized Kernel Methods With Statistical Guarantees

One approach to improving the running time of kernel-based machine learning methods is to build a small sketch of the input and use it in lieu of the full kernel matrix in the machine learning task of interest. Here, we describe a version…

Machine Learning · Statistics 2015-11-10 Ahmed El Alaoui , Michael W. Mahoney

Fast Approximation of Similarity Graphs with Kernel Density Estimation

Constructing a similarity graph from a set $X$ of data points in $\mathbb{R}^d$ is the first step of many modern clustering algorithms. However, typical constructions of a similarity graph have high time complexity, and a quadratic space…

Data Structures and Algorithms · Computer Science 2023-10-24 Peter Macgregor , He Sun

Sparse Approximation of a Kernel Mean

Kernel means are frequently used to represent probability distributions in machine learning problems. In particular, the well known kernel density estimator and the kernel mean embedding both have the form of a kernel mean. Unfortunately,…

Machine Learning · Statistics 2015-03-03 E. Cruz Cortés , C. Scott

Fast Kernel k-means Clustering Using Incomplete Cholesky Factorization

Kernel-based clustering algorithm can identify and capture the non-linear structure in datasets, and thereby it can achieve better performance than linear clustering. However, computing and storing the entire kernel matrix occupy so large…

Machine Learning · Computer Science 2020-02-10 Li Chen , Shuisheng Zhou , Jiajun Ma

NNK-Means: Data summarization using dictionary learning with non-negative kernel regression

An increasing number of systems are being designed by gathering significant amounts of data and then optimizing the system parameters directly using the obtained data. Often this is done without analyzing the dataset structure. As task…

Machine Learning · Computer Science 2022-06-14 Sarath Shekkizhar , Antonio Ortega

Efficient Geometric-based Computation of the String Subsequence Kernel

Kernel methods are powerful tools in machine learning. They have to be computationally efficient. In this paper, we present a novel Geometric-based approach to compute efficiently the string subsequence kernel (SSK). Our main idea is that…

Machine Learning · Computer Science 2015-03-02 Slimane Bellaouar , Hadda Cherroun , Djelloul Ziadi

Debiased Distribution Compression

Modern compression methods can summarize a target distribution $\mathbb{P}$ more succinctly than i.i.d. sampling but require access to a low-bias input sequence like a Markov chain converging quickly to $\mathbb{P}$. We introduce a new…

Machine Learning · Statistics 2024-08-02 Lingxiao Li , Raaz Dwivedi , Lester Mackey

FALKON: An Optimal Large Scale Kernel Method

Kernel methods provide a principled way to perform non linear, nonparametric learning. They rely on solid functional analytic foundations and enjoy optimal statistical properties. However, at least in their basic form, they have limited…

Machine Learning · Statistics 2018-02-01 Alessandro Rudi , Luigi Carratino , Lorenzo Rosasco

Learning the kernel matrix via predictive low-rank approximations

Efficient and accurate low-rank approximations of multiple data sources are essential in the era of big data. The scaling of kernel-based learning algorithms to large datasets is limited by the O(n^2) computation and storage complexity of…

Machine Learning · Computer Science 2020-12-10 Martin Stražar , Tomaž Curk

Linear Time Kernel Matrix Approximation via Hyperspherical Harmonics

We propose a new technique for constructing low-rank approximations of matrices that arise in kernel methods for machine learning. Our approach pairs a novel automatically constructed analytic expansion of the underlying kernel function…

Machine Learning · Computer Science 2022-02-09 John Paul Ryan , Anil Damle

Sub-quadratic Algorithms for Kernel Matrices via Kernel Density Estimation

Kernel matrices, as well as weighted graphs represented by them, are ubiquitous objects in machine learning, statistics and other related fields. The main drawback of using kernel methods (learning and inference using kernel matrices) is…

Machine Learning · Computer Science 2022-12-02 Ainesh Bakshi , Piotr Indyk , Praneeth Kacham , Sandeep Silwal , Samson Zhou