Related papers: Beyond Cross-Validation: Adaptive Parameter Select…

Adaptive Stopping Rule for Kernel-based Gradient Descent Algorithms

In this paper, we propose an adaptive stopping rule for kernel-based gradient descent (KGD) algorithms. We introduce the empirical effective dimension to quantify the increments of iterations in KGD and derive an implementable early…

Machine Learning · Computer Science 2023-06-14 Xiangyu Chang , Shao-Bo Lin

Adaptive Kernel Selection for Stein Variational Gradient Descent

A central challenge in Bayesian inference is efficiently approximating posterior distributions. Stein Variational Gradient Descent (SVGD) is a popular variational inference method which transports a set of particles to approximate a target…

Machine Learning · Statistics 2025-12-05 Moritz Melcher , Simon Weissmann , Ashia C. Wilson , Jakob Zech

Simultaneous Model Selection and Optimization through Parameter-free Stochastic Learning

Stochastic gradient descent algorithms for training linear and kernel predictors are gaining more and more importance, thanks to their scalability. While various methods have been proposed to speed up their convergence, the model selection…

Machine Learning · Computer Science 2014-06-17 Francesco Orabona

A Computable Measure of Suboptimality for Entropy-Regularised Variational Objectives

Several emerging post-Bayesian methods target a probability distribution for which an entropy-regularised variational objective is minimised. This increased flexibility introduces a computational challenge, as one loses access to an…

Computation · Statistics 2025-12-17 Clémentine Chazal , Heishiro Kanagawa , Zheyang Shen , Anna Korba , Chris. J. Oates

Stein Variational Gradient Descent with Multiple Kernel

Stein variational gradient descent (SVGD) and its variants have shown promising successes in approximate inference for complex distributions. In practice, we notice that the kernel used in SVGD-based methods has a decisive effect on the…

Machine Learning · Computer Science 2022-11-29 Qingzhong Ai , Shiyu Liu , Lirong He , Zenglin Xu

On the geometry of Stein variational gradient descent

Bayesian inference problems require sampling or approximating high-dimensional probability distributions. The focus of this paper is on the recently introduced Stein variational gradient descent methodology, a class of algorithms that rely…

Machine Learning · Statistics 2023-02-14 A. Duncan , N. Nuesken , L. Szpruch

Towards understanding Accelerated Stein Variational Gradient Flow -- Analysis of Generalized Bilinear Kernels for Gaussian target distributions

Stein variational gradient descent (SVGD) is a kernel-based and non-parametric particle method for sampling from a target distribution, such as in Bayesian inference and other machine learning tasks. Different from other particle methods,…

Optimization and Control · Mathematics 2025-10-02 Viktor Stein , Wuchen Li

Truncated Kernel Stochastic Gradient Descent with General Losses and Spherical Radial Basis Functions

In this paper, we propose a novel kernel stochastic gradient descent (SGD) algorithm for large-scale supervised learning with general losses. Compared to traditional kernel SGD, our algorithm improves efficiency and scalability through an…

Machine Learning · Computer Science 2026-04-28 Jinhui Bai , Andreas Christmann , Lei Shi

A Gradient-based Kernel Optimization Approach for Parabolic Distributed Parameter Control Systems

This paper proposes a new gradient-based optimization approach for designing optimal feedback kernels for parabolic distributed parameter systems with boundary control. Unlike traditional kernel optimization methods for parabolic systems,…

Optimization and Control · Mathematics 2016-03-16 Zhigang Ren , Chao Xu , Qun Lin , Ryan Loxton

Gradient-based kernel dimension reduction for supervised learning

This paper proposes a novel kernel approach to linear dimension reduction for supervised learning. The purpose of the dimension reduction is to find directions in the input space to explain the output as effectively as possible. The…

Machine Learning · Statistics 2011-09-05 Kenji Fukumizu , Chenlei Leng

Stein Variational Gradient Descent With Matrix-Valued Kernels

Stein variational gradient descent (SVGD) is a particle-based inference algorithm that leverages gradient information for efficient approximate inference. In this work, we enhance SVGD by leveraging preconditioning matrices, such as the…

Machine Learning · Statistics 2019-11-06 Dilin Wang , Ziyang Tang , Chandrajit Bajaj , Qiang Liu

Adaptive Parameter Selection for Kernel Ridge Regression

This paper focuses on parameter selection issues of kernel ridge regression (KRR). Due to special spectral properties of KRR, we find that delicate subdivision of the parameter interval shrinks the difference between two successive KRR…

Machine Learning · Computer Science 2023-12-12 Shao-Bo Lin

Stochastic Gradient Descent for Two-layer Neural Networks

This paper presents a comprehensive study on the convergence rates of the stochastic gradient descent (SGD) algorithm when applied to overparameterized two-layer neural networks. Our approach combines the Neural Tangent Kernel (NTK)…

Machine Learning · Statistics 2024-07-11 Dinghao Cao , Zheng-Chu Guo , Lei Shi

Generalized Gaussian Kernel Adaptive Filtering

The present paper proposes generalized Gaussian kernel adaptive filtering, where the kernel parameters are adaptive and data-driven. The Gaussian kernel is parametrized by a center vector and a symmetric positive definite (SPD) precision…

Machine Learning · Computer Science 2021-05-20 Tomoya Wada , Kosuke Fukumori , Toshihisa Tanaka , Simone Fiori

FastPart: Over-Parameterized Stochastic Gradient Descent for Sparse optimisation on Measures

This paper presents a novel algorithm that leverages Stochastic Gradient Descent strategies in conjunction with Random Features to augment the scalability of Conic Particle Gradient Descent (CPGD) specifically tailored for solving sparse…

Optimization and Control · Mathematics 2025-09-05 Yohann De Castro , Sébastien Gadat , Clément Marteau

Fast Bounded Online Gradient Descent Algorithms for Scalable Kernel-Based Online Learning

Kernel-based online learning has often shown state-of-the-art performance for many online learning tasks. It, however, suffers from a major shortcoming, that is, the unbounded number of support vectors, making it non-scalable and unsuitable…

Machine Learning · Computer Science 2012-06-22 Peilin Zhao , Jialei Wang , Pengcheng Wu , Rong Jin , Steven C. H. Hoi

Guided parallelized stochastic gradient descent for delay compensation

Stochastic gradient descent (SGD) algorithm and its variations have been effectively used to optimize neural network models. However, with the rapid growth of big data and deep learning, SGD is no longer the most suitable choice due to its…

Machine Learning · Computer Science 2024-02-13 Anuraganand Sharma

Improving the Convergence Rates of Forward Gradient Descent with Repeated Sampling

Forward gradient descent (FGD) has been proposed as a biologically more plausible alternative of gradient descent as it can be computed without backward pass. Considering the linear model with $d$ parameters, previous work has found that…

Statistics Theory · Mathematics 2024-11-27 Niklas Dexheimer , Johannes Schmidt-Hieber

Convergence of Implicit Gradient Descent for Training Two-Layer Physics-Informed Neural Networks

The optimization algorithms are crucial in training physics-informed neural networks (PINNs), as unsuitable methods may lead to poor solutions. Compared to the common gradient descent (GD) algorithm, implicit gradient descent (IGD)…

Machine Learning · Computer Science 2025-08-04 Xianliang Xu , Ting Du , Wang Kong , Bin Shan , Ye Li , Zhongyi Huang

Stochastic Gradient Descent Meets Distribution Regression

Stochastic gradient descent (SGD) provides a simple and efficient way to solve a broad range of machine learning problems. Here, we focus on distribution regression (DR), involving two stages of sampling: Firstly, we regress from…

Machine Learning · Statistics 2021-03-08 Nicole Mücke