Related papers: Preconditioning for Scalable Gaussian Process Hype…

Fast and Accurate Gaussian Kernel Ridge Regression Using Matrix Decompositions for Preconditioning

This paper presents a method for building a preconditioner for a kernel ridge regression problem, where the preconditioner is not only effective in its ability to reduce the condition number substantially, but also efficient in its…

Numerical Analysis · Mathematics 2021-04-07 Gil Shabat , Era Choshen , Dvir Ben Or , Nadav Carmel

A Solution to the Ill-Conditioning of Gradient-Enhanced Covariance Matrices for Gaussian Processes

Gaussian processes provide probabilistic surrogates for various applications including classification, uncertainty quantification, and optimization. Using a gradient-enhanced covariance matrix can be beneficial since it provides a more…

Optimization and Control · Mathematics 2023-07-13 André L. Marchildon , David W. Zingg

Preconditioning Kernel Matrices

The computational and storage complexity of kernel machines presents the primary barrier to their scaling to large, modern, datasets. A common way to tackle the scalability issue is to use the conjugate gradient algorithm, which relieves…

Machine Learning · Statistics 2016-05-26 Kurt Cutajar , Michael A. Osborne , John P. Cunningham , Maurizio Filippone

Improving Linear System Solvers for Hyperparameter Optimisation in Iterative Gaussian Processes

Scaling hyperparameter optimisation to very large datasets remains an open problem in the Gaussian process community. This paper focuses on iterative methods, which use linear system solvers, like conjugate gradients, alternating…

Machine Learning · Computer Science 2025-01-14 Jihao Andreas Lin , Shreyas Padhy , Bruno Mlodozeniec , Javier Antorán , José Miguel Hernández-Lobato

Accelerating Posterior sampling for Scalable Gaussian Process model

This Paper conducts a thorough simulation study to assess the effectiveness of various acceleration techniques designed to enhance the conjugate gradient algorithm, which is used for solving large linear systems to accelerate Bayesian…

Computation · Statistics 2025-05-06 Zhihao Zhou

Scalable Gaussian Processes: Advances in Iterative Methods and Pathwise Conditioning

Gaussian processes are a powerful framework for uncertainty-aware function approximation and sequential decision-making. Unfortunately, their classical formulation does not scale gracefully to large amounts of data and modern hardware for…

Machine Learning · Computer Science 2025-07-10 Jihao Andreas Lin

Tighter Bounds on the Log Marginal Likelihood of Gaussian Process Regression Using Conjugate Gradients

We propose a lower bound on the log marginal likelihood of Gaussian process regression models that can be computed without matrix factorisation of the full kernel matrix. We show that approximate maximum likelihood learning of model…

Machine Learning · Statistics 2021-02-17 Artem Artemev , David R. Burt , Mark van der Wilk

A quantum gradient descent algorithm for optimizing Gaussian Process models

Gaussian Process Regression (GPR) is a nonparametric supervised learning method, widely valued for its ability to quantify uncertainty. Despite its advantages and broad applications, classical GPR implementations face significant…

Quantum Physics · Physics 2025-03-25 Junpeng Hu , Jinglai Li , Lei Zhang , Shi Jin

How to optimize preconditioners for the conjugate gradient method: a stochastic approach

The conjugate gradient method (CG) is typically used with a preconditioner which improves efficiency and robustness of the method. Many preconditioners include parameters and a proper choice of a preconditioner and its parameters is often…

Numerical Analysis · Mathematics 2019-06-04 Alexandr Katrutsa , Mike Botchev , George Ovchinnikov , Ivan Oseledets

Active Probabilistic Inference on Matrices for Pre-Conditioning in Stochastic Optimization

Pre-conditioning is a well-known concept that can significantly improve the convergence of optimization algorithms. For noise-free problems, where good pre-conditioners are not known a priori, iterative linear algebra methods offer one way…

Machine Learning · Computer Science 2019-02-21 Filip de Roos , Philipp Hennig

Deep Learning-Enhanced Preconditioning for Efficient Conjugate Gradient Solvers in Large-Scale PDE Systems

Preconditioning techniques are crucial for enhancing the efficiency of solving large-scale linear equation systems that arise from partial differential equation (PDE) discretization. These techniques, such as Incomplete Cholesky…

Machine Learning · Computer Science 2024-12-11 Rui Li , Song Wang , Chen Wang

Iterative Methods for Full-Scale Gaussian Process Approximations for Large Spatial Data

Gaussian processes are flexible probabilistic regression models which are widely used in statistics and machine learning. However, a drawback is their limited scalability to large data sets. To alleviate this, full-scale approximations…

Methodology · Statistics 2026-01-13 Tim Gyger , Reinhard Furrer , Fabio Sigrist

Improving Iterative Gaussian Processes via Warm Starting Sequential Posteriors

Scalable Gaussian process (GP) inference is essential for sequential decision-making tasks, yet improving GP scalability remains a challenging problem with many open avenues of research. This paper focuses on iterative GPs, where iterative…

Machine Learning · Computer Science 2025-11-21 Alan Yufei Dong , Jihao Andreas Lin , José Miguel Hernández-Lobato

Fast and accurate conditioning for large-scale and online Gaussian process prediction problems

Gaussian Process (GP) models provide a flexible framework for prediction and uncertainty quantification. For most covariance functions, however, exact GP prediction with $n$ points scales as $\mathcal{O}(n^3)$, making it prohibitively…

Computation · Statistics 2026-05-29 Samanyu Arora , Christopher J. Geoga

Scalable Gaussian Process Hyperparameter Optimization via Coverage Regularization

Gaussian processes (GPs) are Bayesian non-parametric models popular in a variety of applications due to their accuracy and native uncertainty quantification (UQ). Tuning GP hyperparameters is critical to ensure the validity of prediction…

Machine Learning · Computer Science 2022-11-03 Killian Wood , Alec M. Dunton , Amanda Muyskens , Benjamin W. Priest

Learning Preconditioner for Conjugate Gradient PDE Solvers

Efficient numerical solvers for partial differential equations empower science and engineering. One of the commonly employed numerical solvers is the preconditioned conjugate gradient (PCG) algorithm which can solve large systems to a given…

Numerical Analysis · Mathematics 2023-09-07 Yichen Li , Peter Yichen Chen , Tao Du , Wojciech Matusik

An Efficient Scaled spectral preconditioner for sequences of symmetric positive definite linear systems

We explore a scaled spectral preconditioner for the efficient solution of sequences of symmetric and positive-definite linear systems. We design the scaled preconditioner not only as an approximation of the inverse of the linear system but…

Numerical Analysis · Mathematics 2024-10-04 Youssef Diouane , Selime Gürol , Oussama Mouhtal , Dominique Orban

Efficient Marginal Likelihood Computation for Gaussian Process Regression

In a Bayesian learning setting, the posterior distribution of a predictive model arises from a trade-off between its prior distribution and the conditional likelihood of observed data. Such distribution functions usually rely on additional…

Machine Learning · Statistics 2011-11-01 Andrea Schirru , Simone Pampuri , Giuseppe De Nicolao , Sean McLoone

Scalable Log Determinants for Gaussian Process Kernel Learning

For applications as varied as Bayesian neural networks, determinantal point processes, elliptical graphical models, and kernel learning for Gaussian processes (GPs), one must compute a log determinant of an $n \times n$ positive definite…

Machine Learning · Statistics 2017-11-10 Kun Dong , David Eriksson , Hannes Nickisch , David Bindel , Andrew Gordon Wilson

Gaussian Process Inference Using Mini-batch Stochastic Gradient Descent: Convergence Guarantees and Empirical Benefits

Stochastic gradient descent (SGD) and its variants have established themselves as the go-to algorithms for large-scale machine learning problems with independent samples due to their generalization performance and intrinsic computational…

Machine Learning · Statistics 2025-08-25 Hao Chen , Lili Zheng , Raed Al Kontar , Garvesh Raskutti