English
Related papers

Related papers: Collaborative Learning in Kernel-based Bandits for…

200 papers

We consider distributed kernel bandits where $N$ agents aim to collaboratively maximize an unknown reward function that lies in a reproducing kernel Hilbert space. Each agent sequentially queries the function to obtain noisy observations at…

Machine Learning · Computer Science 2024-02-21 Nikola Pavlovic , Sudeep Salgia , Qing Zhao

The demand for collaborative and private bandit learning across multiple agents is surging due to the growing quantity of data generated from distributed systems. Federated bandit learning has emerged as a promising framework for private,…

Machine Learning · Computer Science 2024-03-04 Ethan Blaser , Chuanhao Li , Hongning Wang

Multi-armed bandit algorithms provide solutions for sequential decision-making where learning takes place by interacting with the environment. In this work, we model a distributed optimization problem as a multi-agent kernelized multi-armed…

Machine Learning · Computer Science 2023-12-11 Ayush Rai , Shaoshuai Mou

We consider a kernelized bandit problem with a compact arm set ${X} \subset \mathbb{R}^d $ and a fixed but unknown reward function $f^*$ with a finite norm in some Reproducing Kernel Hilbert Space (RKHS). We propose a class of…

Machine Learning · Computer Science 2025-06-13 Bingshan Hu , Zheng He , Danica J. Sutherland

In this paper we propose a novel framework for decentralized, online learning by many learners. At each moment of time, an instance characterized by a certain context may arrive to each learner; based on the context, the learner can select…

Machine Learning · Computer Science 2015-03-24 Cem Tekin , Mihaela van der Schaar

We study a generalization of the problem of online learning in adversarial linear contextual bandits by incorporating loss functions that belong to a reproducing kernel Hilbert space, which allows for a more flexible modeling of complex…

Machine Learning · Statistics 2023-10-04 Gergely Neu , Julia Olkhovskaya , Sattar Vakili

Contextual bandit algorithms have been recently studied under the federated learning setting to satisfy the demand of keeping data decentralized and pushing the learning of bandit models to the client side. But limited by the required…

Machine Learning · Computer Science 2022-10-14 Chuanhao Li , Hongning Wang

Cooperative multi-agent decision making involves a group of agents cooperatively solving learning problems while communicating over a network with delays. In this paper, we consider the kernelised contextual bandit problem, where the reward…

Machine Learning · Computer Science 2020-08-17 Abhimanyu Dubey , Alex Pentland

We consider the stochastic bandit problem with a continuous set of arms, with the expected reward function over the arms assumed to be fixed but unknown. We provide two new Gaussian process-based algorithms for continuous bandit…

Machine Learning · Computer Science 2017-05-18 Sayak Ray Chowdhury , Aditya Gopalan

In the kernelized bandit problem, a learner aims to sequentially compute the optimum of a function lying in a reproducing kernel Hilbert space given only noisy evaluations at sequentially chosen points. In particular, the learner aims to…

Machine Learning · Computer Science 2023-08-15 Justin Whitehouse , Zhiwei Steven Wu , Aaditya Ramdas

Consider the sequential optimization of an expensive to evaluate and possibly non-convex objective function $f$ from noisy feedback, that can be considered as a continuum-armed bandit problem. Upper bounds on the regret performance of…

Machine Learning · Statistics 2021-03-11 Sattar Vakili , Kia Khezeli , Victor Picheny

This paper addresses the problem of distributed learning under communication constraints, motivated by distributed signal processing in wireless sensor networks and data mining with distributed databases. After formalizing a general model…

Machine Learning · Computer Science 2016-11-15 Joel B. Predd , Sanjeev R. Kulkarni , H. Vincent Poor

We consider multi-agent stochastic optimization problems over reproducing kernel Hilbert spaces (RKHS). In this setting, a network of interconnected agents aims to learn decision functions, i.e., nonlinear statistical models, that are…

Optimization and Control · Mathematics 2018-07-04 Alec Koppel , Santiago Paternain , Cedric Richard , Alejandro Ribeiro

This paper studies federated linear contextual bandits under the notion of user-level differential privacy (DP). We first introduce a unified federated bandits framework that can accommodate various definitions of DP in the sequential…

Machine Learning · Computer Science 2023-06-14 Ruiquan Huang , Huanyu Zhang , Luca Melis , Milan Shen , Meisam Hajzinia , Jing Yang

This paper introduces a novel approach to personalised federated learning within the $\mathcal{X}$-armed bandit framework, addressing the challenge of optimising both local and global objectives in a highly heterogeneous environment. Our…

Machine Learning · Statistics 2024-09-12 Ali Arabzadeh , James A. Grant , David S. Leslie

We study the problem of federated stochastic multi-arm contextual bandits with unknown contexts, in which M agents are faced with different bandits and collaborate to learn. The communication model consists of a central server and the…

Machine Learning · Computer Science 2024-01-31 Jiabin Lin , Shana Moothedath

We consider online learning for minimizing regret in unknown, episodic Markov decision processes (MDPs) with continuous states and actions. We develop variants of the UCRL and posterior sampling algorithms that employ nonparametric Gaussian…

Machine Learning · Computer Science 2019-01-04 Sayak Ray Chowdhury , Aditya Gopalan

Linear contextual bandit is a popular online learning problem. It has been mostly studied in centralized learning settings. With the surging demand of large-scale decentralized model learning, e.g., federated learning, how to retain regret…

Machine Learning · Computer Science 2021-10-05 Chuanhao Li , Hongning Wang

We consider the problem of contextual kernel bandits with stochastic contexts, where the underlying reward function belongs to a known Reproducing Kernel Hilbert Space. We study this problem under an additional constraint of Differential…

Machine Learning · Statistics 2025-07-21 Nikola Pavlovic , Sudeep Salgia , Qing Zhao

We consider the problem of optimizing an unknown (typically non-convex) function with a bounded norm in some Reproducing Kernel Hilbert Space (RKHS), based on noisy bandit feedback. We consider a novel variant of this problem in which the…

Machine Learning · Statistics 2020-03-05 Ilija Bogunovic , Andreas Krause , Jonathan Scarlett
‹ Prev 1 2 3 10 Next ›