Related papers: Gaussian Process Optimization with Mutual Informat…

Regret Analysis for Randomized Gaussian Process Upper Confidence Bound

Gaussian process upper confidence bound (GP-UCB) is a theoretically established algorithm for Bayesian optimization (BO), where we assume the objective function $f$ follows a GP. One notable drawback of GP-UCB is that the theoretical…

Machine Learning · Computer Science 2025-11-10 Shion Takeno , Yu Inatsu , Masayuki Karasuyama

Improved Regret Bounds for Gaussian Process Upper Confidence Bound in Bayesian Optimization

This paper addresses the Bayesian optimization problem (also referred to as the Bayesian setting of the Gaussian process bandit), where the learner seeks to minimize the regret under a function drawn from a known Gaussian process (GP).…

Machine Learning · Computer Science 2025-12-12 Shogo Iwazaki

Regret bounds for meta Bayesian optimization with an unknown Gaussian process prior

Bayesian optimization usually assumes that a Bayesian prior is given. However, the strong theoretical guarantees in Bayesian optimization are often regrettably compromised in practice because of unknown parameters in the prior. In this…

Machine Learning · Computer Science 2018-11-26 Zi Wang , Beomjoon Kim , Leslie Pack Kaelbling

Parallel Gaussian Process Optimization with Upper Confidence Bound and Pure Exploration

In this paper, we consider the challenge of maximizing an unknown function f for which evaluations are noisy and are acquired with high cost. An iterative procedure uses the previous measures to actively select the next estimation of f…

Machine Learning · Computer Science 2013-09-03 Emile Contal , David Buffoni , Alexandre Robicquet , Nicolas Vayatis

Gaussian Process Optimization in the Bandit Setting: No Regret and Experimental Design

Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We formalize this task as a multi-armed bandit problem, where the payoff function is either sampled from a Gaussian process (GP) or has low RKHS…

Machine Learning · Computer Science 2015-03-13 Niranjan Srinivas , Andreas Krause , Sham M. Kakade , Matthias Seeger

On Improved Regret Bounds In Bayesian Optimization with Gaussian Noise

Bayesian optimization (BO) with Gaussian process (GP) surrogate models is a powerful black-box optimization method. Acquisition functions are a critical part of a BO algorithm as they determine how the new samples are selected. Some of the…

Machine Learning · Computer Science 2024-12-30 Jingyi Wang , Haowei Wang , Cosmin G. Petra , Nai-Yuan Chiang

Randomized Gaussian Process Upper Confidence Bound with Tighter Bayesian Regret Bounds

Gaussian process upper confidence bound (GP-UCB) is a theoretically promising approach for black-box optimization; however, the confidence parameter $\beta$ is considerably large in the theorem and chosen heuristically in practice. Then,…

Machine Learning · Computer Science 2023-06-13 Shion Takeno , Yu Inatsu , Masayuki Karasuyama

Optimization for Gaussian Processes via Chaining

In this paper, we consider the problem of stochastic optimization under a bandit feedback model. We generalize the GP-UCB algorithm [Srinivas and al., 2012] to arbitrary kernels and search spaces. To do so, we use a notion of localized…

Machine Learning · Statistics 2015-10-20 Emile Contal , Cédric Malherbe , Nicolas Vayatis

Optimization as Estimation with Gaussian Processes in Bandit Settings

Recently, there has been rising interest in Bayesian optimization -- the optimization of an unknown function with assumptions usually expressed by a Gaussian Process (GP) prior. We study an optimization strategy that directly uses an…

Machine Learning · Statistics 2018-08-14 Zi Wang , Bolei Zhou , Stefanie Jegelka

Regret Bounds for Expected Improvement Algorithms in Gaussian Process Bandit Optimization

The expected improvement (EI) algorithm is one of the most popular strategies for optimization under uncertainty due to its simplicity and efficiency. Despite its popularity, the theoretical aspects of this algorithm have not been properly…

Machine Learning · Computer Science 2026-04-28 Hung Tran-The , Sunil Gupta , Santu Rana , Svetha Venkatesh

Regret Bounds for Gaussian-Process Optimization in Large Domains

The goal of this paper is to characterize Gaussian-Process optimization in the setting where the function domain is large relative to the number of admissible function evaluations, i.e., where it is impossible to find the global optimum. We…

Machine Learning · Computer Science 2022-01-26 Manuel Wüthrich , Bernhard Schölkopf , Andreas Krause

Time-Varying Gaussian Process Bandit Optimization

We consider the sequential Bayesian optimization problem with bandit feedback, adopting a formulation that allows for the reward function to vary with time. We model the reward function using a Gaussian process whose evolution obeys a…

Machine Learning · Statistics 2016-01-26 Ilija Bogunovic , Jonathan Scarlett , Volkan Cevher

Randomised Gaussian Process Upper Confidence Bound for Bayesian Optimisation

In order to improve the performance of Bayesian optimisation, we develop a modified Gaussian process upper confidence bound (GP-UCB) acquisition function. This is done by sampling the exploration-exploitation trade-off parameter from a…

Machine Learning · Computer Science 2020-06-09 Julian Berk , Sunil Gupta , Santu Rana , Svetha Venkatesh

On the Suboptimality of GP-UCB under Polynomial Effective Optimism

Gaussian process upper confidence bound (GP-UCB) is widely used for sequential optimization of expensive black-box functions. Although many upper bounds on its cumulative regret have been established in the literature, whether GP-UCB is…

Machine Learning · Computer Science 2026-05-21 Wenjia Wang , Xiaowei Zhang

Regret and Belief Complexity Trade-off in Gaussian Process Bandits via Information Thresholding

Bayesian optimization is a framework for global search via maximum a posteriori updates rather than simulated annealing, and has gained prominence for decision-making under uncertainty. In this work, we cast Bayesian optimization as a…

Machine Learning · Computer Science 2022-03-24 Amrit Singh Bedi , Dheeraj Peddireddy , Vaneet Aggarwal , Brian M. Sadler , Alec Koppel

On Information Gain and Regret Bounds in Gaussian Process Bandits

Consider the sequential optimization of an expensive to evaluate and possibly non-convex objective function $f$ from noisy feedback, that can be considered as a continuum-armed bandit problem. Upper bounds on the regret performance of…

Machine Learning · Statistics 2021-03-11 Sattar Vakili , Kia Khezeli , Victor Picheny

Bayesian Optimization with Expected Improvement: No Regret and the Choice of Incumbent

Expected improvement (EI) is one of the most widely used acquisition functions in Bayesian optimization (BO). Despite its proven empirical success in applications, the cumulative regret upper bound of EI remains an open question. In this…

Machine Learning · Statistics 2025-08-22 Jingyi Wang , Haowei Wang , Szu Hui Ng , Cosmin G. Petra

Stochastic Process Bandits: Upper Confidence Bounds Algorithms via Generic Chaining

The paper considers the problem of global optimization in the setup of stochastic process bandits. We introduce an UCB algorithm which builds a cascade of discretization trees based on generic chaining in order to render possible his…

Machine Learning · Statistics 2016-02-22 Emile Contal , Nicolas Vayatis

Bayesian Analysis of Combinatorial Gaussian Process Bandits

We consider the combinatorial volatile Gaussian process (GP) semi-bandit problem. Each round, an agent is provided a set of available base arms and must select a subset of them to maximize the long-term cumulative reward. We study the…

Machine Learning · Computer Science 2025-02-13 Jack Sandberg , Niklas Åkerblom , Morteza Haghir Chehreghani

Regret Bounds for Safe Gaussian Process Bandit Optimization

Many applications require a learner to make sequential decisions given uncertainty regarding both the system's payoff function and safety constraints. In safety-critical systems, it is paramount that the learner's actions do not violate the…

Machine Learning · Computer Science 2020-05-06 Sanae Amani , Mahnoosh Alizadeh , Christos Thrampoulidis