Related papers: Large Scale Kernel Learning using Block Coordinate…

Faster Low-Rank Approximation and Kernel Ridge Regression via the Block-Nystr\"om Method

The Nystr\"om method is a popular low-rank approximation technique for large matrices that arise in kernel methods and convex optimization. Yet, when the data exhibits heavy-tailed spectral decay, the effective dimension of the problem…

Data Structures and Algorithms · Computer Science 2025-07-22 Sachin Garg , Michał Dereziński

Scalable Kernel Logistic Regression with Nystr\"om Approximation: Theoretical Analysis and Application to Discrete Choice Modelling

The application of kernel-based Machine Learning (ML) techniques to discrete choice modelling using large datasets often faces challenges due to memory requirements and the considerable number of parameters involved in these models. This…

Machine Learning · Computer Science 2024-12-04 José Ángel Martín-Baos , Ricardo García-Ródenas , Luis Rodriguez-Benitez , Michel Bierlaire

Supervising Nystr\"om Methods via Negative Margin Support Vector Selection

The Nystr\"om methods have been popular techniques for scalable kernel based learning. They approximate explicit, low-dimensional feature mappings for kernel functions from the pairwise comparisons with the training data. However, Nystr\"om…

Machine Learning · Computer Science 2018-05-21 Mert Al , Thee Chanyaswad , Sun-Yuan Kung

Hierarchically Compositional Kernels for Scalable Nonparametric Learning

We propose a novel class of kernels to alleviate the high computational cost of large-scale nonparametric learning with kernel methods. The proposed kernel is defined based on a hierarchical partitioning of the underlying data domain, where…

Machine Learning · Computer Science 2017-08-15 Jie Chen , Haim Avron , Vikas Sindhwani

A Distributed Algorithm for Training Nonlinear Kernel Machines

This paper concerns the distributed training of nonlinear kernel machines on Map-Reduce. We show that a re-formulation of Nystr\"om approximation based solution which is solved using gradient based techniques is well suited for this,…

Machine Learning · Computer Science 2014-05-20 Dhruv Mahajan , S. Sathiya Keerthi , S. Sundararajan

Less is More: Nystr\"om Computational Regularization

We study Nystr\"om type subsampling approaches to large scale kernel methods, and prove learning bounds in the statistical learning setting, where random sampling and high probability estimates are considered. In particular, we prove that…

Machine Learning · Statistics 2016-03-18 Alessandro Rudi , Raffaello Camoriano , Lorenzo Rosasco

Data-dependent compression of random features for large-scale kernel approximation

Kernel methods offer the flexibility to learn complex relationships in modern, large data sets while enjoying strong theoretical guarantees on quality. Unfortunately, these methods typically require cubic running time in the data set size,…

Machine Learning · Statistics 2019-03-01 Raj Agrawal , Trevor Campbell , Jonathan H. Huggins , Tamara Broderick

Randomized Clustered Nystrom for Large-Scale Kernel Machines

The Nystrom method has been popular for generating the low-rank approximation of kernel matrices that arise in many machine learning problems. The approximation quality of the Nystrom method depends crucially on the number of selected…

Machine Learning · Statistics 2016-12-21 Farhad Pourkamali-Anaraki , Stephen Becker

Learning Theory of Distributed Regression with Bias Corrected Regularization Kernel Network

Distributed learning is an effective way to analyze big data. In distributed regression, a typical approach is to divide the big data into multiple blocks, apply a base regression algorithm on each of them, and then simply average the…

Machine Learning · Computer Science 2017-08-08 Zhengchu Guo , Lei Shi , Qiang Wu

Scalable Kernel K-Means Clustering with Nystrom Approximation: Relative-Error Bounds

Kernel $k$-means clustering can correctly identify and extract a far more varied collection of cluster structures than the linear $k$-means clustering algorithm. However, kernel $k$-means clustering is computationally expensive when the…

Machine Learning · Computer Science 2019-02-12 Shusen Wang , Alex Gittens , Michael W. Mahoney

A Randomized Mirror Descent Algorithm for Large Scale Multiple Kernel Learning

We consider the problem of simultaneously learning to linearly combine a very large number of kernels and learn a good predictor based on the learnt kernel. When the number of kernels $d$ to be combined is very large, multiple kernel…

Machine Learning · Computer Science 2015-03-20 Arash Afkanpour , András György , Csaba Szepesvári , Michael Bowling

Diversity sampling is an implicit regularization for kernel methods

Kernel methods have achieved very good performance on large scale regression and classification problems, by using the Nystr\"om method and preconditioning techniques. The Nystr\"om approximation -- based on a subset of landmarks -- gives a…

Machine Learning · Computer Science 2020-02-21 Michaël Fanuel , Joachim Schreurs , Johan A. K. Suykens

Kernel PCA with the Nystr\"om method

The Nystr\"om method is one of the most popular techniques for improving the scalability of kernel methods. However, it has not yet been derived for kernel PCA in line with classical PCA. In this paper we derive kernel PCA with the…

Machine Learning · Statistics 2022-08-22 Fredrik Hallgren

Improved Fixed-Rank Nystr\"om Approximation via QR Decomposition: Practical and Theoretical Aspects

The Nystrom method is a popular technique that uses a small number of landmark points to compute a fixed-rank approximation of large kernel matrices that arise in machine learning problems. In practice, to ensure high quality…

Machine Learning · Statistics 2019-12-04 Farhad Pourkamali-Anaraki , Stephen Becker

Recursive Sampling for the Nystr\"om Method

We give the first algorithm for kernel Nystr\"om approximation that runs in *linear time in the number of training points* and is provably accurate for all kernel matrices, without dependence on regularity or incoherence conditions. The…

Machine Learning · Computer Science 2017-11-06 Cameron Musco , Christopher Musco

On the Feasibility of Distributed Kernel Regression for Big Data

In modern scientific research, massive datasets with huge numbers of observations are frequently encountered. To facilitate the computational process, a divide-and-conquer scheme is often used for the analysis of big data. In such a…

Machine Learning · Statistics 2015-05-06 Chen Xu , Yongquan Zhang , Runze Li

Distributed Coordinate Descent for Generalized Linear Models with Regularization

Generalized linear model with $L_1$ and $L_2$ regularization is a widely used technique for solving classification, class probability estimation and regression problems. With the numbers of both features and examples growing rapidly in the…

Machine Learning · Statistics 2017-06-28 Ilya Trofimov , Alexander Genkin

Reducing training time by efficient localized kernel regression

We study generalization properties of kernel regularized least squares regression based on a partitioning approach. We show that optimal rates of convergence are preserved if the number of local sets grows sufficiently slowly with the…

Statistics Theory · Mathematics 2019-02-26 Nicole Mücke

Fast Statistical Leverage Score Approximation in Kernel Ridge Regression

Nystr\"om approximation is a fast randomized method that rapidly solves kernel ridge regression (KRR) problems through sub-sampling the n-by-n empirical kernel matrix appearing in the objective function. However, the performance of such a…

Machine Learning · Statistics 2021-03-10 Yifan Chen , Yun Yang

Scalable Learning in Reproducing Kernel Krein Spaces

We provide the first mathematically complete derivation of the Nystr\"om method for low-rank approximation of indefinite kernels and propose an efficient method for finding an approximate eigendecomposition of such kernel matrices. Building…

Machine Learning · Statistics 2019-06-03 Dino Oglic , Thomas Gärtner