Related papers: Graph-Based Recommendation System

Graph Signal Sampling via Reinforcement Learning

We formulate the problem of sampling and recovering clustered graph signal as a multi-armed bandit (MAB) problem. This formulation lends naturally to learning sampling strategies using the well-known gradient MAB algorithm. In particular,…

Machine Learning · Statistics 2018-05-16 Oleksii Abramenko , Alexander Jung

Local Clustering in Contextual Multi-Armed Bandits

We study identifying user clusters in contextual multi-armed bandits (MAB). Contextual MAB is an effective tool for many real applications, such as content recommendation and online advertisement. In practice, user dependency plays an…

Machine Learning · Computer Science 2023-03-27 Yikun Ban , Jingrui He

Graph Clustering Bandits for Recommendation

We investigate an efficient context-dependent clustering technique for recommender systems based on exploration-exploitation strategies through multi-armed bandits over multiple users. Our algorithm dynamically groups users based on their…

Machine Learning · Statistics 2016-05-03 Shuai Li , Claudio Gentile , Alexandros Karatzoglou

Multi-armed Bandit Learning on a Graph

The multi-armed bandit(MAB) problem is a simple yet powerful framework that has been extensively studied in the context of decision-making under uncertainty. In many real-world applications, such as robotic applications, selecting an arm…

Machine Learning · Computer Science 2023-03-21 Tianpeng Zhang , Kasper Johansson , Na Li

A Field Test of Bandit Algorithms for Recommendations: Understanding the Validity of Assumptions on Human Preferences in Multi-armed Bandits

Personalized recommender systems suffuse modern life, shaping what media we read and what products we consume. Algorithms powering such systems tend to consist of supervised learning-based heuristics, such as latent factor models with a…

Information Retrieval · Computer Science 2023-04-19 Liu Leqi , Giulio Zhou , Fatma Kılınç-Karzan , Zachary C. Lipton , Alan L. Montgomery

Online Clustering of Dueling Bandits

The contextual multi-armed bandit (MAB) is a widely used framework for problems requiring sequential decision-making under uncertainty, such as recommendation systems. In applications involving a large number of users, the performance of…

Machine Learning · Computer Science 2025-02-05 Zhiyong Wang , Jiahang Sun , Mingze Kong , Jize Xie , Qinghua Hu , John C. S. Lui , Zhongxiang Dai

Graph Neural Bandits

Contextual bandits algorithms aim to choose the optimal arm with the highest reward out of a set of candidates based on the contextual information. Various bandit algorithms have been applied to real-world applications due to their ability…

Machine Learning · Computer Science 2023-08-22 Yunzhe Qi , Yikun Ban , Jingrui He

Max-Utility Based Arm Selection Strategy For Sequential Query Recommendations

We consider the query recommendation problem in closed loop interactive learning settings like online information gathering and exploratory analytics. The problem can be naturally modelled using the Multi-Armed Bandits (MAB) framework with…

Machine Learning · Computer Science 2024-03-29 Shameem A. Puthiya Parambath , Christos Anagnostopoulos , Roderick Murray-Smith , Sean MacAvaney , Evangelos Zervas

Context-Aware Bandits

We propose an efficient Context-Aware clustering of Bandits (CAB) algorithm, which can capture collaborative effects. CAB can be easily deployed in a real-world recommendation system, where multi-armed bandits have been shown to perform…

Machine Learning · Computer Science 2017-02-28 Shuai Li , Purushottam Kar

On Context-Dependent Clustering of Bandits

We investigate a novel cluster-of-bandit algorithm CAB for collaborative recommendation tasks that implements the underlying feedback sharing mechanism by estimating the neighborhood of users in a context-dependent manner. CAB makes sharp…

Machine Learning · Computer Science 2017-02-28 Claudio Gentile , Shuai Li , Purushottam Kar , Alexandros Karatzoglou , Evans Etrue , Giovanni Zappella

Federated Online Clustering of Bandits

Contextual multi-armed bandit (MAB) is an important sequential decision-making problem in recommendation systems. A line of works, called the clustering of bandits (CLUB), utilize the collaborative effect over users and dramatically improve…

Machine Learning · Computer Science 2022-09-01 Xutong Liu , Haoru Zhao , Tong Yu , Shuai Li , John C. S. Lui

Task Recommendation in Crowdsourcing Based on Learning Preferences and Reliabilities

Workers participating in a crowdsourcing platform can have a wide range of abilities and interests. An important problem in crowdsourcing is the task recommendation problem, in which tasks that best match a particular worker's preferences…

Human-Computer Interaction · Computer Science 2018-07-30 Qiyu Kang , Wee Peng Tay

A Survey on Practical Applications of Multi-Armed and Contextual Bandits

In recent years, multi-armed bandit (MAB) framework has attracted a lot of attention in various applications, from recommender systems and information retrieval to healthcare and finance, due to its stellar performance combined with certain…

Machine Learning · Computer Science 2019-04-24 Djallel Bouneffouf , Irina Rish

Accelerated learning from recommender systems using multi-armed bandit

Recommendation systems are a vital component of many online marketplaces, where there are often millions of items to potentially present to users who have a wide variety of wants or needs. Evaluating recommender system algorithms is a hard…

Information Retrieval · Computer Science 2019-08-20 Meisam Hejazinia , Kyler Eastman , Shuqin Ye , Abbas Amirabadi , Ravi Divvela

Contextual Multi-armed Bandit Algorithm for Semiparametric Reward Model

Contextual multi-armed bandit (MAB) algorithms have been shown promising for maximizing cumulative rewards in sequential decision tasks such as news article recommendation systems, web page ad placement algorithms, and mobile health.…

Machine Learning · Statistics 2019-02-01 Gi-Soo Kim , Myunghee Cho Paik

Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts

The contextual multi-armed bandit (MAB) problem is crucial in sequential decision-making. A line of research, known as online clustering of bandits, extends contextual MAB by grouping similar users into clusters, utilizing shared features…

Machine Learning · Computer Science 2025-01-03 Zhuohua Li , Maoli Liu , Xiangxiang Dai , John C. S. Lui

Distributed Cooperative Decision-Making in Multiarmed Bandits: Frequentist and Bayesian Algorithms

We study distributed cooperative decision-making under the explore-exploit tradeoff in the multiarmed bandit (MAB) problem. We extend the state-of-the-art frequentist and Bayesian algorithms for single-agent MAB problems to cooperative…

Systems and Control · Computer Science 2019-09-18 Peter Landgren , Vaibhav Srivastava , Naomi Ehrich Leonard

A Gang of Bandits

Multi-armed bandit problems are receiving a great deal of attention because they adequately formalize the exploration-exploitation trade-offs arising in several industrially relevant applications, such as online advertisement and, more…

Machine Learning · Computer Science 2013-11-05 Nicolò Cesa-Bianchi , Claudio Gentile , Giovanni Zappella

Multi-Armed Bandits in Brain-Computer Interfaces

The multi-armed bandit (MAB) problem models a decision-maker that optimizes its actions based on current and acquired new knowledge to maximize its reward. This type of online decision is prominent in many procedures of Brain-Computer…

Artificial Intelligence · Computer Science 2022-11-10 Frida Heskebeck , Carolina Bergeling , Bo Bernhardsson

A Contextual Bandits Approach for Personalization of Hand Gesture Recognition

In human-computer interaction applications like hand gesture recognition, supervised learning models are often trained on a large population of users to achieve high task accuracy. However, due to individual variability in sensor signals…

Human-Computer Interaction · Computer Science 2025-09-12 Duke Lin , Michael Paskett , Ying Yang