English
Related papers

Related papers: Gaussian Process Optimization with Mutual Informat…

200 papers

Gaussian process upper confidence bound (GP-UCB) is a theoretically established algorithm for Bayesian optimization (BO), where we assume the objective function $f$ follows a GP. One notable drawback of GP-UCB is that the theoretical…

Machine Learning · Computer Science 2025-11-10 Shion Takeno , Yu Inatsu , Masayuki Karasuyama

This paper addresses the Bayesian optimization problem (also referred to as the Bayesian setting of the Gaussian process bandit), where the learner seeks to minimize the regret under a function drawn from a known Gaussian process (GP).…

Machine Learning · Computer Science 2025-12-12 Shogo Iwazaki

Bayesian optimization usually assumes that a Bayesian prior is given. However, the strong theoretical guarantees in Bayesian optimization are often regrettably compromised in practice because of unknown parameters in the prior. In this…

Machine Learning · Computer Science 2018-11-26 Zi Wang , Beomjoon Kim , Leslie Pack Kaelbling

In this paper, we consider the challenge of maximizing an unknown function f for which evaluations are noisy and are acquired with high cost. An iterative procedure uses the previous measures to actively select the next estimation of f…

Machine Learning · Computer Science 2013-09-03 Emile Contal , David Buffoni , Alexandre Robicquet , Nicolas Vayatis

Many applications require optimizing an unknown, noisy function that is expensive to evaluate. We formalize this task as a multi-armed bandit problem, where the payoff function is either sampled from a Gaussian process (GP) or has low RKHS…

Machine Learning · Computer Science 2015-03-13 Niranjan Srinivas , Andreas Krause , Sham M. Kakade , Matthias Seeger

Bayesian optimization (BO) with Gaussian process (GP) surrogate models is a powerful black-box optimization method. Acquisition functions are a critical part of a BO algorithm as they determine how the new samples are selected. Some of the…

Machine Learning · Computer Science 2024-12-30 Jingyi Wang , Haowei Wang , Cosmin G. Petra , Nai-Yuan Chiang

Gaussian process upper confidence bound (GP-UCB) is a theoretically promising approach for black-box optimization; however, the confidence parameter $\beta$ is considerably large in the theorem and chosen heuristically in practice. Then,…

Machine Learning · Computer Science 2023-06-13 Shion Takeno , Yu Inatsu , Masayuki Karasuyama

In this paper, we consider the problem of stochastic optimization under a bandit feedback model. We generalize the GP-UCB algorithm [Srinivas and al., 2012] to arbitrary kernels and search spaces. To do so, we use a notion of localized…

Machine Learning · Statistics 2015-10-20 Emile Contal , Cédric Malherbe , Nicolas Vayatis

Recently, there has been rising interest in Bayesian optimization -- the optimization of an unknown function with assumptions usually expressed by a Gaussian Process (GP) prior. We study an optimization strategy that directly uses an…

Machine Learning · Statistics 2018-08-14 Zi Wang , Bolei Zhou , Stefanie Jegelka

The expected improvement (EI) algorithm is one of the most popular strategies for optimization under uncertainty due to its simplicity and efficiency. Despite its popularity, the theoretical aspects of this algorithm have not been properly…

Machine Learning · Computer Science 2026-04-28 Hung Tran-The , Sunil Gupta , Santu Rana , Svetha Venkatesh

The goal of this paper is to characterize Gaussian-Process optimization in the setting where the function domain is large relative to the number of admissible function evaluations, i.e., where it is impossible to find the global optimum. We…

Machine Learning · Computer Science 2022-01-26 Manuel Wüthrich , Bernhard Schölkopf , Andreas Krause

We consider the sequential Bayesian optimization problem with bandit feedback, adopting a formulation that allows for the reward function to vary with time. We model the reward function using a Gaussian process whose evolution obeys a…

Machine Learning · Statistics 2016-01-26 Ilija Bogunovic , Jonathan Scarlett , Volkan Cevher

In order to improve the performance of Bayesian optimisation, we develop a modified Gaussian process upper confidence bound (GP-UCB) acquisition function. This is done by sampling the exploration-exploitation trade-off parameter from a…

Machine Learning · Computer Science 2020-06-09 Julian Berk , Sunil Gupta , Santu Rana , Svetha Venkatesh

Gaussian process upper confidence bound (GP-UCB) is widely used for sequential optimization of expensive black-box functions. Although many upper bounds on its cumulative regret have been established in the literature, whether GP-UCB is…

Machine Learning · Computer Science 2026-05-21 Wenjia Wang , Xiaowei Zhang

Bayesian optimization is a framework for global search via maximum a posteriori updates rather than simulated annealing, and has gained prominence for decision-making under uncertainty. In this work, we cast Bayesian optimization as a…

Machine Learning · Computer Science 2022-03-24 Amrit Singh Bedi , Dheeraj Peddireddy , Vaneet Aggarwal , Brian M. Sadler , Alec Koppel

Consider the sequential optimization of an expensive to evaluate and possibly non-convex objective function $f$ from noisy feedback, that can be considered as a continuum-armed bandit problem. Upper bounds on the regret performance of…

Machine Learning · Statistics 2021-03-11 Sattar Vakili , Kia Khezeli , Victor Picheny

Expected improvement (EI) is one of the most widely used acquisition functions in Bayesian optimization (BO). Despite its proven empirical success in applications, the cumulative regret upper bound of EI remains an open question. In this…

Machine Learning · Statistics 2025-08-22 Jingyi Wang , Haowei Wang , Szu Hui Ng , Cosmin G. Petra

The paper considers the problem of global optimization in the setup of stochastic process bandits. We introduce an UCB algorithm which builds a cascade of discretization trees based on generic chaining in order to render possible his…

Machine Learning · Statistics 2016-02-22 Emile Contal , Nicolas Vayatis

We consider the combinatorial volatile Gaussian process (GP) semi-bandit problem. Each round, an agent is provided a set of available base arms and must select a subset of them to maximize the long-term cumulative reward. We study the…

Machine Learning · Computer Science 2025-02-13 Jack Sandberg , Niklas Åkerblom , Morteza Haghir Chehreghani

Many applications require a learner to make sequential decisions given uncertainty regarding both the system's payoff function and safety constraints. In safety-critical systems, it is paramount that the learner's actions do not violate the…

Machine Learning · Computer Science 2020-05-06 Sanae Amani , Mahnoosh Alizadeh , Christos Thrampoulidis
‹ Prev 1 2 3 10 Next ›