Related papers: Sparse solutions of the kernel herding algorithm b…

On the Equivalence between Herding and Conditional Gradient Algorithms

We show that the herding procedure of Welling (2009) takes exactly the form of a standard convex optimization algorithm--namely a conditional gradient algorithm minimizing a quadratic moment discrepancy. This link enables us to invoke…

Machine Learning · Computer Science 2012-09-12 Francis Bach , Simon Lacoste-Julien , Guillaume Obozinski

Sparse Multiple Kernel Learning with Geometric Convergence Rate

In this paper, we study the problem of sparse multiple kernel learning (MKL), where the goal is to efficiently learn a combination of a fixed small number of kernels from a large pool that could lead to a kernel classifier with a small…

Machine Learning · Computer Science 2013-02-05 Rong Jin , Tianbao Yang , Mehrdad Mahdavi

Sparse Approximation of a Kernel Mean

Kernel means are frequently used to represent probability distributions in machine learning problems. In particular, the well known kernel density estimator and the kernel mean embedding both have the form of a kernel mean. Unfortunately,…

Machine Learning · Statistics 2015-03-03 E. Cruz Cortés , C. Scott

Sparse multiresolution representations with adaptive kernels

Reproducing kernel Hilbert spaces (RKHSs) are key elements of many non-parametric tools successfully used in signal processing, statistics, and machine learning. In this work, we aim to address three issues of the classical RKHS based…

Signal Processing · Electrical Eng. & Systems 2019-05-09 Maria Peifer , Luiz. F. O. Chamon , Santiago Paternain , Alejandro Ribeiro

Fully symmetric kernel quadrature

Kernel quadratures and other kernel-based approximation methods typically suffer from prohibitive cubic time and quadratic space complexity in the number of function evaluations. The problem arises because a system of linear equations needs…

Numerical Analysis · Mathematics 2018-01-09 Toni Karvonen , Simo Särkkä

Monte Carlo with kernel-based Gibbs measures: Guarantees for probabilistic herding

Kernel herding belongs to a family of deterministic quadratures that seek to minimize the worst-case integration error over a reproducing kernel Hilbert space (RKHS). These quadrature rules come with strong experimental evidence that this…

Machine Learning · Computer Science 2025-08-12 Martin Rouault , Rémi Bardenet , Mylène Maïda

Sparser Kernel Herding with Pairwise Conditional Gradients without Swap Steps

The Pairwise Conditional Gradients (PCG) algorithm is a powerful extension of the Frank-Wolfe algorithm leading to particularly sparse solutions, which makes PCG very appealing for problems such as sparse signal recovery, sparse regression,…

Optimization and Control · Mathematics 2022-02-09 Kazuma Tsuji , Ken'ichiro Tanaka , Sebastian Pokutta

Approximate Gradient Coding via Sparse Random Graphs

Distributed algorithms are often beset by the straggler effect, where the slowest compute nodes in the system dictate the overall running time. Coding-theoretic techniques have been recently proposed to mitigate stragglers via algorithmic…

Machine Learning · Statistics 2017-11-21 Zachary Charles , Dimitris Papailiopoulos , Jordan Ellenberg

Convergence rates of Kernel Conjugate Gradient for random design regression

We prove statistical rates of convergence for kernel-based least squares regression from i.i.d. data using a conjugate gradient algorithm, where regularization against overfitting is obtained by early stopping. This method is related to…

Statistics Theory · Mathematics 2016-07-11 Gilles Blanchard , Nicole Krämer

Balance is Essence: Accelerating Sparse Training via Adaptive Gradient Correction

Despite impressive performance, deep neural networks require significant memory and computation costs, prohibiting their application in resource-constrained scenarios. Sparse training is one of the most common techniques to reduce these…

Machine Learning · Computer Science 2023-12-06 Bowen Lei , Dongkuan Xu , Ruqi Zhang , Shuren He , Bani K. Mallick

Efficient Numerical Integration in Reproducing Kernel Hilbert Spaces via Leverage Scores Sampling

In this work we consider the problem of numerical integration, i.e., approximating integrals with respect to a target probability measure using only pointwise evaluations of the integrand. We focus on the setting in which the target…

Machine Learning · Statistics 2025-06-17 Antoine Chatalic , Nicolas Schreuder , Ernesto De Vito , Lorenzo Rosasco

Convolutional Sparse Coding Fast Approximation with Application to Seismic Reflectivity Estimation

In sparse coding, we attempt to extract features of input vectors, assuming that the data is inherently structured as a sparse superposition of basic building blocks. Similarly, neural networks perform a given task by learning features of…

Machine Learning · Computer Science 2022-02-16 Deborah Pereg , Israel Cohen , Anthony A. Vassiliou

Super-Samples from Kernel Herding

We extend the herding algorithm to continuous spaces by using the kernel trick. The resulting "kernel herding" algorithm is an infinite memory deterministic process that learns to approximate a PDF with a collection of samples. We show that…

Machine Learning · Computer Science 2012-03-19 Yutian Chen , Max Welling , Alex Smola

RFN: A Random-Feature Based Newton Method for Empirical Risk Minimization in Reproducing Kernel Hilbert Spaces

In supervised learning using kernel methods, we often encounter a large-scale finite-sum minimization over a reproducing kernel Hilbert space (RKHS). Large-scale finite-sum problems can be solved using efficient variants of Newton method,…

Machine Learning · Computer Science 2022-06-07 Ting-Jui Chang , Shahin Shahrampour

Near Input Sparsity Time Kernel Embeddings via Adaptive Sampling

To accelerate kernel methods, we propose a near input sparsity time algorithm for sampling the high-dimensional feature space implicitly defined by a kernel transformation. Our main contribution is an importance sampling method for…

Data Structures and Algorithms · Computer Science 2020-07-15 David P. Woodruff , Amir Zandieh

Kernel quadrature with randomly pivoted Cholesky

This paper presents new quadrature rules for functions in a reproducing kernel Hilbert space using nodes drawn by a sampling algorithm known as randomly pivoted Cholesky. The resulting computational procedure compares favorably to previous…

Numerical Analysis · Mathematics 2023-12-08 Ethan N. Epperly , Elvira Moreno

A Forward Backward Greedy approach for Sparse Multiscale Learning

Multiscale Models are known to be successful in uncovering and analyzing the structures in data at different resolutions. In the current work we propose a feature driven Reproducing Kernel Hilbert space (RKHS), for which the associated…

Machine Learning · Computer Science 2022-08-24 Prashant Shekhar , Abani Patra

Partial Gradient Optimal Thresholding Algorithms for a Class of Sparse Optimization Problems

The optimization problems with a sparsity constraint is a class of important global optimization problems. A typical type of thresholding algorithms for solving such a problem adopts the traditional full steepest descent direction or…

Optimization and Control · Mathematics 2021-07-20 Nan Meng , Yun-Bin Zhao , Michal Kocvara

Convergence guarantees for kernel-based quadrature rules in misspecified settings

Kernel-based quadrature rules are becoming important in machine learning and statistics, as they achieve super-$\sqrt{n}$ convergence rates in numerical integration, and thus provide alternatives to Monte Carlo integration in challenging…

Machine Learning · Statistics 2016-10-31 Motonobu Kanagawa , Bharath K. Sriperumbudur , Kenji Fukumizu

Efficient kernel-based variable selection with sparsistency

Variable selection is central to high-dimensional data analysis, and various algorithms have been developed. Ideally, a variable selection algorithm shall be flexible, scalable, and with theoretical guarantee, yet most existing algorithms…

Machine Learning · Statistics 2021-02-04 Xin He , Junhui Wang , Shaogao Lv