Related papers: Improved Convergence Rates for Sparse Approximatio…

Connections and Equivalences between the Nystr\"om Method and Sparse Variational Gaussian Processes

We investigate the connections between sparse approximation methods for making kernel methods and Gaussian processes (GPs) scalable to large-scale data, focusing on the Nystr\"om method and the Sparse Variational Gaussian Processes (SVGP).…

Machine Learning · Statistics 2023-02-09 Veit Wild , Motonobu Kanagawa , Dino Sejdinovic

Rates of Convergence for Sparse Variational Gaussian Process Regression

Excellent variational approximations to Gaussian process posteriors have been developed which avoid the $\mathcal{O}\left(N^3\right)$ scaling with dataset size $N$. They reduce the computational cost to $\mathcal{O}\left(NM^2\right)$, with…

Machine Learning · Statistics 2019-09-05 David R. Burt , Carl E. Rasmussen , Mark van der Wilk

Convergence of Sparse Variational Inference in Gaussian Processes Regression

Gaussian processes are distributions over functions that are versatile and mathematically convenient priors in Bayesian modelling. However, their use is often impeded for data with large numbers of observations, $N$, due to the cubic (in…

Machine Learning · Statistics 2020-08-04 David R. Burt , Carl Edward Rasmussen , Mark van der Wilk

A Bound on the Maximal Marginal Degrees of Freedom

Kernel ridge regression, in general, is expensive in memory allocation and computation time. This paper addresses low rank approximations and surrogates for kernel ridge regression, which bridge these difficulties. The fundamental…

Machine Learning · Statistics 2025-01-07 Paul Dommel

Fast Approximate Multi-output Gaussian Processes

Gaussian processes regression models are an appealing machine learning method as they learn expressive non-linear models from exemplar data with minimal parameter tuning and estimate both the mean and covariance of unseen points. However,…

Machine Learning · Computer Science 2020-08-25 Vladimir Joukov , Dana Kulić

Convergence of Gaussian process regression: Optimality, robustness, and relationship with kernel ridge regression

In this work, we investigate Gaussian process regression used to recover a function based on noisy observations. We derive upper and lower error bounds for Gaussian process regression with possibly misspecified correlation functions. The…

Statistics Theory · Mathematics 2022-07-20 Wenjia Wang , Bing-Yi Jing

Efficient Hyperparameter Tuning for Large Scale Kernel Ridge Regression

Kernel methods provide a principled approach to nonparametric learning. While their basic implementations scale poorly to large problems, recent advances showed that approximate solvers can efficiently handle massive datasets. A shortcoming…

Machine Learning · Computer Science 2022-01-19 Giacomo Meanti , Luigi Carratino , Ernesto De Vito , Lorenzo Rosasco

Nystr\"om Kernel Mean Embeddings

Kernel mean embeddings are a powerful tool to represent probability distributions over arbitrary spaces as single points in a Hilbert space. Yet, the cost of computing and storing such embeddings prohibits their direct use in large-scale…

Machine Learning · Statistics 2022-06-16 Antoine Chatalic , Nicolas Schreuder , Alessandro Rudi , Lorenzo Rosasco

Fast Statistical Leverage Score Approximation in Kernel Ridge Regression

Nystr\"om approximation is a fast randomized method that rapidly solves kernel ridge regression (KRR) problems through sub-sampling the n-by-n empirical kernel matrix appearing in the objective function. However, the performance of such a…

Machine Learning · Statistics 2021-03-10 Yifan Chen , Yun Yang

A Perturbation-Based Kernel Approximation Framework

Kernel methods are powerful tools in various data analysis tasks. Yet, in many cases, their time and space complexity render them impractical for large datasets. Various kernel approximation methods were proposed to overcome this issue,…

Machine Learning · Computer Science 2022-05-24 Roy Mitz , Yoel Shkolnisky

Efficient online learning with kernels for adversarial large scale problems

We are interested in a framework of online learning with kernels for low-dimensional but large-scale and potentially adversarial datasets. We study the computational and theoretical performance of online variations of kernel Ridge…

Machine Learning · Statistics 2019-05-30 Rémi Jézéquel , Pierre Gaillard , Alessandro Rudi

Sparse Approximation of a Kernel Mean

Kernel means are frequently used to represent probability distributions in machine learning problems. In particular, the well known kernel density estimator and the kernel mean embedding both have the form of a kernel mean. Unfortunately,…

Machine Learning · Statistics 2015-03-03 E. Cruz Cortés , C. Scott

Supervising Nystr\"om Methods via Negative Margin Support Vector Selection

The Nystr\"om methods have been popular techniques for scalable kernel based learning. They approximate explicit, low-dimensional feature mappings for kernel functions from the pairwise comparisons with the training data. However, Nystr\"om…

Machine Learning · Computer Science 2018-05-21 Mert Al , Thee Chanyaswad , Sun-Yuan Kung

Gaussian Process Regression under Computational and Epistemic Misspecification

Gaussian process regression is a classical kernel method for function estimation and data interpolation. In large data applications, computational costs can be reduced using low-rank or sparse approximations of the kernel. This paper…

Numerical Analysis · Mathematics 2024-10-04 Daniel Sanz-Alonso , Ruiyi Yang

Kernel PCA with the Nystr\"om method

The Nystr\"om method is one of the most popular techniques for improving the scalability of kernel methods. However, it has not yet been derived for kernel PCA in line with classical PCA. In this paper we derive kernel PCA with the…

Machine Learning · Statistics 2022-08-22 Fredrik Hallgren

Recursive Sampling for the Nystr\"om Method

We give the first algorithm for kernel Nystr\"om approximation that runs in *linear time in the number of training points* and is provably accurate for all kernel matrices, without dependence on regularity or incoherence conditions. The…

Machine Learning · Computer Science 2017-11-06 Cameron Musco , Christopher Musco

Scalable Kernel K-Means Clustering with Nystrom Approximation: Relative-Error Bounds

Kernel $k$-means clustering can correctly identify and extract a far more varied collection of cluster structures than the linear $k$-means clustering algorithm. However, kernel $k$-means clustering is computationally expensive when the…

Machine Learning · Computer Science 2019-02-12 Shusen Wang , Alex Gittens , Michael W. Mahoney

Statistical Optimality and Computational Efficiency of Nystr\"om Kernel PCA

Kernel methods provide an elegant framework for developing nonlinear learning algorithms from simple linear methods. Though these methods have superior empirical performance in several real data applications, their usefulness is inhibited…

Machine Learning · Statistics 2021-05-20 Nicholas Sterge , Bharath Sriperumbudur

Less is More: Nystr\"om Computational Regularization

We study Nystr\"om type subsampling approaches to large scale kernel methods, and prove learning bounds in the statistical learning setting, where random sampling and high probability estimates are considered. In particular, we prove that…

Machine Learning · Statistics 2016-03-18 Alessandro Rudi , Raffaello Camoriano , Lorenzo Rosasco

Near Input Sparsity Time Kernel Embeddings via Adaptive Sampling

To accelerate kernel methods, we propose a near input sparsity time algorithm for sampling the high-dimensional feature space implicitly defined by a kernel transformation. Our main contribution is an importance sampling method for…

Data Structures and Algorithms · Computer Science 2020-07-15 David P. Woodruff , Amir Zandieh