English
Related papers

Related papers: Sparse solutions of the kernel herding algorithm b…

200 papers

We show that the herding procedure of Welling (2009) takes exactly the form of a standard convex optimization algorithm--namely a conditional gradient algorithm minimizing a quadratic moment discrepancy. This link enables us to invoke…

Machine Learning · Computer Science 2012-09-12 Francis Bach , Simon Lacoste-Julien , Guillaume Obozinski

In this paper, we study the problem of sparse multiple kernel learning (MKL), where the goal is to efficiently learn a combination of a fixed small number of kernels from a large pool that could lead to a kernel classifier with a small…

Machine Learning · Computer Science 2013-02-05 Rong Jin , Tianbao Yang , Mehrdad Mahdavi

Kernel means are frequently used to represent probability distributions in machine learning problems. In particular, the well known kernel density estimator and the kernel mean embedding both have the form of a kernel mean. Unfortunately,…

Machine Learning · Statistics 2015-03-03 E. Cruz Cortés , C. Scott

Reproducing kernel Hilbert spaces (RKHSs) are key elements of many non-parametric tools successfully used in signal processing, statistics, and machine learning. In this work, we aim to address three issues of the classical RKHS based…

Signal Processing · Electrical Eng. & Systems 2019-05-09 Maria Peifer , Luiz. F. O. Chamon , Santiago Paternain , Alejandro Ribeiro

Kernel quadratures and other kernel-based approximation methods typically suffer from prohibitive cubic time and quadratic space complexity in the number of function evaluations. The problem arises because a system of linear equations needs…

Numerical Analysis · Mathematics 2018-01-09 Toni Karvonen , Simo Särkkä

Kernel herding belongs to a family of deterministic quadratures that seek to minimize the worst-case integration error over a reproducing kernel Hilbert space (RKHS). These quadrature rules come with strong experimental evidence that this…

Machine Learning · Computer Science 2025-08-12 Martin Rouault , Rémi Bardenet , Mylène Maïda

The Pairwise Conditional Gradients (PCG) algorithm is a powerful extension of the Frank-Wolfe algorithm leading to particularly sparse solutions, which makes PCG very appealing for problems such as sparse signal recovery, sparse regression,…

Optimization and Control · Mathematics 2022-02-09 Kazuma Tsuji , Ken'ichiro Tanaka , Sebastian Pokutta

Distributed algorithms are often beset by the straggler effect, where the slowest compute nodes in the system dictate the overall running time. Coding-theoretic techniques have been recently proposed to mitigate stragglers via algorithmic…

Machine Learning · Statistics 2017-11-21 Zachary Charles , Dimitris Papailiopoulos , Jordan Ellenberg

We prove statistical rates of convergence for kernel-based least squares regression from i.i.d. data using a conjugate gradient algorithm, where regularization against overfitting is obtained by early stopping. This method is related to…

Statistics Theory · Mathematics 2016-07-11 Gilles Blanchard , Nicole Krämer

Despite impressive performance, deep neural networks require significant memory and computation costs, prohibiting their application in resource-constrained scenarios. Sparse training is one of the most common techniques to reduce these…

Machine Learning · Computer Science 2023-12-06 Bowen Lei , Dongkuan Xu , Ruqi Zhang , Shuren He , Bani K. Mallick

In this work we consider the problem of numerical integration, i.e., approximating integrals with respect to a target probability measure using only pointwise evaluations of the integrand. We focus on the setting in which the target…

Machine Learning · Statistics 2025-06-17 Antoine Chatalic , Nicolas Schreuder , Ernesto De Vito , Lorenzo Rosasco

In sparse coding, we attempt to extract features of input vectors, assuming that the data is inherently structured as a sparse superposition of basic building blocks. Similarly, neural networks perform a given task by learning features of…

Machine Learning · Computer Science 2022-02-16 Deborah Pereg , Israel Cohen , Anthony A. Vassiliou

We extend the herding algorithm to continuous spaces by using the kernel trick. The resulting "kernel herding" algorithm is an infinite memory deterministic process that learns to approximate a PDF with a collection of samples. We show that…

Machine Learning · Computer Science 2012-03-19 Yutian Chen , Max Welling , Alex Smola

In supervised learning using kernel methods, we often encounter a large-scale finite-sum minimization over a reproducing kernel Hilbert space (RKHS). Large-scale finite-sum problems can be solved using efficient variants of Newton method,…

Machine Learning · Computer Science 2022-06-07 Ting-Jui Chang , Shahin Shahrampour

To accelerate kernel methods, we propose a near input sparsity time algorithm for sampling the high-dimensional feature space implicitly defined by a kernel transformation. Our main contribution is an importance sampling method for…

Data Structures and Algorithms · Computer Science 2020-07-15 David P. Woodruff , Amir Zandieh

This paper presents new quadrature rules for functions in a reproducing kernel Hilbert space using nodes drawn by a sampling algorithm known as randomly pivoted Cholesky. The resulting computational procedure compares favorably to previous…

Numerical Analysis · Mathematics 2023-12-08 Ethan N. Epperly , Elvira Moreno

Multiscale Models are known to be successful in uncovering and analyzing the structures in data at different resolutions. In the current work we propose a feature driven Reproducing Kernel Hilbert space (RKHS), for which the associated…

Machine Learning · Computer Science 2022-08-24 Prashant Shekhar , Abani Patra

The optimization problems with a sparsity constraint is a class of important global optimization problems. A typical type of thresholding algorithms for solving such a problem adopts the traditional full steepest descent direction or…

Optimization and Control · Mathematics 2021-07-20 Nan Meng , Yun-Bin Zhao , Michal Kocvara

Kernel-based quadrature rules are becoming important in machine learning and statistics, as they achieve super-$\sqrt{n}$ convergence rates in numerical integration, and thus provide alternatives to Monte Carlo integration in challenging…

Machine Learning · Statistics 2016-10-31 Motonobu Kanagawa , Bharath K. Sriperumbudur , Kenji Fukumizu

Variable selection is central to high-dimensional data analysis, and various algorithms have been developed. Ideally, a variable selection algorithm shall be flexible, scalable, and with theoretical guarantee, yet most existing algorithms…

Machine Learning · Statistics 2021-02-04 Xin He , Junhui Wang , Shaogao Lv
‹ Prev 1 2 3 10 Next ›