Related papers: Collaborative Learning in Kernel-based Bandits for…

Order-Optimal Regret in Distributed Kernel Bandits using Uniform Sampling with Shared Randomness

We consider distributed kernel bandits where $N$ agents aim to collaboratively maximize an unknown reward function that lies in a reproducing kernel Hilbert space. Each agent sequentially queries the function to obtain noisy observations at…

Machine Learning · Computer Science 2024-02-21 Nikola Pavlovic , Sudeep Salgia , Qing Zhao

Federated Linear Contextual Bandits with Heterogeneous Clients

The demand for collaborative and private bandit learning across multiple agents is surging due to the growing quantity of data generated from distributed systems. Federated bandit learning has emerged as a promising framework for private,…

Machine Learning · Computer Science 2024-03-04 Ethan Blaser , Chuanhao Li , Hongning Wang

Distributed Optimization via Kernelized Multi-armed Bandits

Multi-armed bandit algorithms provide solutions for sequential decision-making where learning takes place by interacting with the environment. In this work, we model a distributed optimization problem as a multi-agent kernelized multi-armed…

Machine Learning · Computer Science 2023-12-11 Ayush Rai , Shaoshuai Mou

Efficient kernelized bandit algorithms via exploration distributions

We consider a kernelized bandit problem with a compact arm set ${X} \subset \mathbb{R}^d $ and a fixed but unknown reward function $f^*$ with a finite norm in some Reproducing Kernel Hilbert Space (RKHS). We propose a class of…

Machine Learning · Computer Science 2025-06-13 Bingshan Hu , Zheng He , Danica J. Sutherland

Distributed Online Learning via Cooperative Contextual Bandits

In this paper we propose a novel framework for decentralized, online learning by many learners. At each moment of time, an instance characterized by a certain context may arrive to each learner; based on the context, the learner can select…

Machine Learning · Computer Science 2015-03-24 Cem Tekin , Mihaela van der Schaar

Adversarial Contextual Bandits Go Kernelized

We study a generalization of the problem of online learning in adversarial linear contextual bandits by incorporating loss functions that belong to a reproducing kernel Hilbert space, which allows for a more flexible modeling of complex…

Machine Learning · Statistics 2023-10-04 Gergely Neu , Julia Olkhovskaya , Sattar Vakili

Communication Efficient Federated Learning for Generalized Linear Bandits

Contextual bandit algorithms have been recently studied under the federated learning setting to satisfy the demand of keeping data decentralized and pushing the learning of bandit models to the client side. But limited by the required…

Machine Learning · Computer Science 2022-10-14 Chuanhao Li , Hongning Wang

Kernel Methods for Cooperative Multi-Agent Contextual Bandits

Cooperative multi-agent decision making involves a group of agents cooperatively solving learning problems while communicating over a network with delays. In this paper, we consider the kernelised contextual bandit problem, where the reward…

Machine Learning · Computer Science 2020-08-17 Abhimanyu Dubey , Alex Pentland

On Kernelized Multi-armed Bandits

We consider the stochastic bandit problem with a continuous set of arms, with the expected reward function over the arms assumed to be fixed but unknown. We provide two new Gaussian process-based algorithms for continuous bandit…

Machine Learning · Computer Science 2017-05-18 Sayak Ray Chowdhury , Aditya Gopalan

On the Sublinear Regret of GP-UCB

In the kernelized bandit problem, a learner aims to sequentially compute the optimum of a function lying in a reproducing kernel Hilbert space given only noisy evaluations at sequentially chosen points. In particular, the learner aims to…

Machine Learning · Computer Science 2023-08-15 Justin Whitehouse , Zhiwei Steven Wu , Aaditya Ramdas

On Information Gain and Regret Bounds in Gaussian Process Bandits

Consider the sequential optimization of an expensive to evaluate and possibly non-convex objective function $f$ from noisy feedback, that can be considered as a continuum-armed bandit problem. Upper bounds on the regret performance of…

Machine Learning · Statistics 2021-03-11 Sattar Vakili , Kia Khezeli , Victor Picheny

Distributed Kernel Regression: An Algorithm for Training Collaboratively

This paper addresses the problem of distributed learning under communication constraints, motivated by distributed signal processing in wireless sensor networks and data mining with distributed databases. After formalizing a general model…

Machine Learning · Computer Science 2016-11-15 Joel B. Predd , Sanjeev R. Kulkarni , H. Vincent Poor

Decentralized Online Learning with Kernels

We consider multi-agent stochastic optimization problems over reproducing kernel Hilbert spaces (RKHS). In this setting, a network of interconnected agents aims to learn decision functions, i.e., nonlinear statistical models, that are…

Optimization and Control · Mathematics 2018-07-04 Alec Koppel , Santiago Paternain , Cedric Richard , Alejandro Ribeiro

Federated Linear Contextual Bandits with User-level Differential Privacy

This paper studies federated linear contextual bandits under the notion of user-level differential privacy (DP). We first introduce a unified federated bandits framework that can accommodate various definitions of DP in the sequential…

Machine Learning · Computer Science 2023-06-14 Ruiquan Huang , Huanyu Zhang , Luca Melis , Milan Shen , Meisam Hajzinia , Jing Yang

Federated $\mathcal{X}$-armed Bandit with Flexible Personalisation

This paper introduces a novel approach to personalised federated learning within the $\mathcal{X}$-armed bandit framework, addressing the challenge of optimising both local and global objectives in a highly heterogeneous environment. Our…

Machine Learning · Statistics 2024-09-12 Ali Arabzadeh , James A. Grant , David S. Leslie

Federated Learning for Heterogeneous Bandits with Unobserved Contexts

We study the problem of federated stochastic multi-arm contextual bandits with unknown contexts, in which M agents are faced with different bandits and collaborate to learn. The communication model consists of a central server and the…

Machine Learning · Computer Science 2024-01-31 Jiabin Lin , Shana Moothedath

Online Learning in Kernelized Markov Decision Processes

We consider online learning for minimizing regret in unknown, episodic Markov decision processes (MDPs) with continuous states and actions. We develop variants of the UCRL and posterior sampling algorithms that employ nonparametric Gaussian…

Machine Learning · Computer Science 2019-01-04 Sayak Ray Chowdhury , Aditya Gopalan

Asynchronous Upper Confidence Bound Algorithms for Federated Linear Bandits

Linear contextual bandit is a popular online learning problem. It has been mostly studied in centralized learning settings. With the surging demand of large-scale decentralized model learning, e.g., federated learning, how to retain regret…

Machine Learning · Computer Science 2021-10-05 Chuanhao Li , Hongning Wang

Differential Privacy in Kernelized Contextual Bandits via Random Projections

We consider the problem of contextual kernel bandits with stochastic contexts, where the underlying reward function belongs to a known Reproducing Kernel Hilbert Space. We study this problem under an additional constraint of Differential…

Machine Learning · Statistics 2025-07-21 Nikola Pavlovic , Sudeep Salgia , Qing Zhao

Corruption-Tolerant Gaussian Process Bandit Optimization

We consider the problem of optimizing an unknown (typically non-convex) function with a bounded norm in some Reproducing Kernel Hilbert Space (RKHS), based on noisy bandit feedback. We consider a novel variant of this problem in which the…

Machine Learning · Statistics 2020-03-05 Ilija Bogunovic , Andreas Krause , Jonathan Scarlett