Related papers: Online Network Source Optimization with Graph-Kern…

Multi-armed Bandit Learning on a Graph

The multi-armed bandit(MAB) problem is a simple yet powerful framework that has been extensively studied in the context of decision-making under uncertainty. In many real-world applications, such as robotic applications, selecting an arm…

Machine Learning · Computer Science 2023-03-21 Tianpeng Zhang , Kasper Johansson , Na Li

A Multi-Armed Bandit Approach to Online Selection and Evaluation of Generative Models

Existing frameworks for evaluating and comparing generative models consider an offline setting, where the evaluator has access to large batches of data produced by the models. However, in practical scenarios, the goal is often to identify…

Machine Learning · Computer Science 2025-03-12 Xiaoyan Hu , Ho-fung Leung , Farzan Farnia

Exploring Partially Observed Networks with Nonparametric Bandits

Real-world networks such as social and communication networks are too large to be observed entirely. Such networks are often partially observed such that network size, network topology, and nodes of the original network are unknown. In this…

Machine Learning · Statistics 2018-04-20 Kaushalya Madhawa , Tsuyoshi Murata

Graph Signal Sampling via Reinforcement Learning

We formulate the problem of sampling and recovering clustered graph signal as a multi-armed bandit (MAB) problem. This formulation lends naturally to learning sampling strategies using the well-known gradient MAB algorithm. In particular,…

Machine Learning · Statistics 2018-05-16 Oleksii Abramenko , Alexander Jung

PAK-UCB Contextual Bandit: An Online Learning Approach to Prompt-Aware Selection of Generative Models and LLMs

Selecting a sample generation scheme from multiple prompt-based generative models, including large language models (LLMs) and prompt-guided image and video generation models, is typically addressed by choosing the model that maximizes an…

Machine Learning · Computer Science 2025-09-05 Xiaoyan Hu , Ho-fung Leung , Farzan Farnia

Nearly Optimal Adaptive Procedure with Change Detection for Piecewise-Stationary Bandit

Multi-armed bandit (MAB) is a class of online learning problems where a learning agent aims to maximize its expected cumulative reward while repeatedly selecting to pull arms with unknown reward distributions. We consider a scenario where…

Machine Learning · Statistics 2019-01-25 Yang Cao , Zheng Wen , Branislav Kveton , Yao Xie

Combinatorial Rising Bandits

Combinatorial online learning is a fundamental task for selecting the optimal action (or super arm) as a combination of base arms in sequential interactions with systems providing stochastic rewards. It is applicable to diverse domains such…

Machine Learning · Computer Science 2026-03-04 Seockbean Song , Youngsik Yoon , Siwei Wang , Wei Chen , Jungseul Ok

Online Graph Learning under Smoothness Priors

The growing success of graph signal processing (GSP) approaches relies heavily on prior identification of a graph over which network data admit certain regularity. However, adaptation to increasingly dynamic environments as well as demands…

Machine Learning · Computer Science 2021-03-08 Seyed Saman Saboksayr , Gonzalo Mateos , Mujdat Cetin

An Online Algorithm for Computation Offloading in Non-Stationary Environments

We consider the latency minimization problem in a task-offloading scenario, where multiple servers are available to the user equipment for outsourcing computational tasks. To account for the temporally dynamic nature of the wireless links…

Signal Processing · Electrical Eng. & Systems 2020-06-23 Aniq Ur Rahman , Gourab Ghatak , Antonio De Domenico

Neural Bandit with Arm Group Graph

Contextual bandits aim to identify among a set of arms the optimal one with the highest reward based on their contextual information. Motivated by the fact that the arms usually exhibit group behaviors and the mutual impacts exist among…

Machine Learning · Computer Science 2022-06-13 Yunzhe Qi , Yikun Ban , Jingrui He

Learning and Fairness in Energy Harvesting: A Maximin Multi-Armed Bandits Approach

Recent advances in wireless radio frequency (RF) energy harvesting allows sensor nodes to increase their lifespan by remotely charging their batteries. The amount of energy harvested by the nodes varies depending on their ambient…

Machine Learning · Computer Science 2020-06-17 Debamita Ghosh , Arun Verma , Manjesh K. Hanawal

Graph Neural Bandits

Contextual bandits algorithms aim to choose the optimal arm with the highest reward out of a set of candidates based on the contextual information. Various bandit algorithms have been applied to real-world applications due to their ability…

Machine Learning · Computer Science 2023-08-22 Yunzhe Qi , Yikun Ban , Jingrui He

Decentralized Contextual Bandits with Network Adaptivity

We consider contextual linear bandits over networks, a class of sequential decision-making problems where learning occurs simultaneously across multiple locations and the reward distributions share structural similarities while also…

Machine Learning · Computer Science 2025-08-26 Chuyun Deng , Huiwen Jia

Distributed Optimization via Kernelized Multi-armed Bandits

Multi-armed bandit algorithms provide solutions for sequential decision-making where learning takes place by interacting with the environment. In this work, we model a distributed optimization problem as a multi-agent kernelized multi-armed…

Machine Learning · Computer Science 2023-12-11 Ayush Rai , Shaoshuai Mou

Multi-Objective Generalized Linear Bandits

In this paper, we study the multi-objective bandits (MOB) problem, where a learner repeatedly selects one arm to play and then receives a reward vector consisting of multiple objectives. MOB has found many real-world applications as varied…

Machine Learning · Computer Science 2019-05-31 Shiyin Lu , Guanghui Wang , Yao Hu , Lijun Zhang

Multi-Armed Bandit Learning in IoT Networks: Learning helps even in non-stationary settings

Setting up the future Internet of Things (IoT) networks will require to support more and more communicating devices. We prove that intelligent devices in unlicensed bands can use Multi-Armed Bandit (MAB) learning algorithms to improve…

Networking and Internet Architecture · Computer Science 2018-07-03 Rémi Bonnefoi , Lilian Besson , Christophe Moy , Emilie Kaufmann , Jacques Palicot

Online Network Inference from Graph-Stationary Signals with Hidden Nodes

Graph learning is the fundamental task of estimating unknown graph connectivity from available data. Typical approaches assume that not only is all information available simultaneously but also that all nodes can be observed. However, in…

Machine Learning · Computer Science 2024-09-16 Andrei Buciulea , Madeline Navarro , Samuel Rey , Santiago Segarra , Antonio G. Marques

Bandit Sampling for Multiplex Networks

Graph neural networks have gained prominence due to their excellent performance in many classification and prediction tasks. In particular, they are used for node classification and link prediction which have a wide range of applications in…

Machine Learning · Computer Science 2022-02-09 Cenk Baykal , Vamsi K. Potluru , Sameena Shah , Manuela M. Veloso

GraB-sampler: Optimal Permutation-based SGD Data Sampler for PyTorch

The online Gradient Balancing (GraB) algorithm greedily choosing the examples ordering by solving the herding problem using per-sample gradients is proved to be the theoretically optimal solution that guarantees to outperform Random…

Machine Learning · Computer Science 2023-10-02 Guanghao Wei

Learning the Optimal Path and DNN Partition for Collaborative Edge Inference

Recent advancements in Deep Neural Networks (DNNs) have catalyzed the development of numerous intelligent mobile applications and services. However, they also introduce significant computational challenges for resource-constrained mobile…

Machine Learning · Computer Science 2024-10-04 Yin Huang , Letian Zhang , Jie Xu