Related papers: Personalized Web Search

The Use of Bandit Algorithms in Intelligent Interactive Recommender Systems

In today's business marketplace, many high-tech Internet enterprises constantly explore innovative ways to provide optimal online user experiences for gaining competitive advantages. The great needs of developing intelligent interactive…

Information Retrieval · Computer Science 2021-07-02 Qing Wang

A Gang of Bandits

Multi-armed bandit problems are receiving a great deal of attention because they adequately formalize the exploration-exploitation trade-offs arising in several industrially relevant applications, such as online advertisement and, more…

Machine Learning · Computer Science 2013-11-05 Nicolò Cesa-Bianchi , Claudio Gentile , Giovanni Zappella

Getting too personal(ized): The importance of feature choice in online adaptive algorithms

Digital educational technologies offer the potential to customize students' experiences and learn what works for which students, enhancing the technology as more students interact with it. We consider whether and when attempting to discover…

Artificial Intelligence · Computer Science 2023-09-07 ZhaoBin Li , Luna Yee , Nathaniel Sauerberg , Irene Sakson , Joseph Jay Williams , Anna N. Rafferty

Hierarchical Exploration for Accelerating Contextual Bandits

Contextual bandit learning is an increasingly popular approach to optimizing recommender systems via user feedback, but can be slow to converge in practice due to the need for exploring a large feature space. In this paper, we propose a…

Machine Learning · Computer Science 2012-07-03 Yisong Yue , Sue Ann Hong , Carlos Guestrin

Identifiable Latent Bandits: Leveraging observational data for personalized decision-making

Sequential decision-making algorithms such as multi-armed bandits can find optimal personalized decisions, but are notoriously sample-hungry. In personalized medicine, for example, training a bandit from scratch for every patient is…

Machine Learning · Computer Science 2026-05-12 Ahmet Zahid Balcıoğlu , Newton Mwai , Emil Carlsson , Fredrik D. Johansson

Exploration in Interactive Personalized Music Recommendation: A Reinforcement Learning Approach

Current music recommender systems typically act in a greedy fashion by recommending songs with the highest user ratings. Greedy recommendation, however, is suboptimal over the long term: it does not actively gather information on user…

Multimedia · Computer Science 2013-11-26 Xinxi Wang , Yi Wang , David Hsu , Ye Wang

Learning to Rank in the Position Based Model with Bandit Feedback

Personalization is a crucial aspect of many online experiences. In particular, content ranking is often a key component in delivering sophisticated personalization results. Commonly, supervised learning-to-rank methods are applied, which…

Machine Learning · Computer Science 2020-04-29 Beyza Ermis , Patrick Ernst , Yannik Stein , Giovanni Zappella

A Contextual-Bandit Approach to Personalized News Article Recommendation

Personalized web services strive to adapt their services (advertisements, news articles, etc) to individual users by making use of both content and user information. Despite a few recent advances, this problem remains challenging for at…

Machine Learning · Computer Science 2012-03-05 Lihong Li , Wei Chu , John Langford , Robert E. Schapire

Player Modeling via Multi-Armed Bandits

This paper focuses on building personalized player models solely from player behavior in the context of adaptive games. We present two main contributions: The first is a novel approach to player modeling based on multi-armed bandits (MABs).…

Artificial Intelligence · Computer Science 2021-02-11 Robert C. Gray , Jichen Zhu , Dannielle Arigo , Evan Forman , Santiago Ontañón

Provably Efficient Multi-Objective Bandit Algorithms under Preference-Centric Customization

Multi-objective multi-armed bandit (MO-MAB) problems traditionally aim to achieve Pareto optimality. However, real-world scenarios often involve users with varying preferences across objectives, resulting in a Pareto-optimal arm that may…

Machine Learning · Computer Science 2025-11-18 Linfeng Cao , Ming Shi , Ness B. Shroff

Enhancing Evolutionary Conversion Rate Optimization via Multi-armed Bandit Algorithms

Conversion rate optimization means designing web interfaces such that more visitors perform a desired action (such as register or purchase) on the site. One promising approach, implemented in Sentient Ascend, is to optimize the design using…

Neural and Evolutionary Computing · Computer Science 2018-11-19 Xin Qiu , Risto Miikkulainen

Learning to Personalize for Web Search Sessions

The task of session search focuses on using interaction data to improve relevance for the user's next query at the session level. In this paper, we formulate session search as a personalization task under the framework of learning to rank.…

Information Retrieval · Computer Science 2020-09-18 Saad Aloteibi , Stephen Clark

Graph Clustering Bandits for Recommendation

We investigate an efficient context-dependent clustering technique for recommender systems based on exploration-exploitation strategies through multi-armed bandits over multiple users. Our algorithm dynamically groups users based on their…

Machine Learning · Statistics 2016-05-03 Shuai Li , Claudio Gentile , Alexandros Karatzoglou

Learning Robust Search Strategies Using a Bandit-Based Approach

Effective solving of constraint problems often requires choosing good or specific search heuristics. However, choosing or designing a good search heuristic is non-trivial and is often a manual process. In this paper, rather than manually…

Artificial Intelligence · Computer Science 2018-05-11 Wei Xia , Roland H. C. Yap

A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit

Adaptive and sequential experiment design is a well-studied area in numerous domains. We survey and synthesize the work of the online statistical learning paradigm referred to as multi-armed bandits integrating the existing research as a…

Machine Learning · Statistics 2015-11-04 Giuseppe Burtini , Jason Loeppky , Ramon Lawrence

Selectively Contextual Bandits

Contextual bandits are widely used in industrial personalization systems. These online learning frameworks learn a treatment assignment policy in the presence of treatment effects that vary with the observed contextual features of the…

Machine Learning · Computer Science 2022-05-11 Claudia Roberts , Maria Dimakopoulou , Qifeng Qiao , Ashok Chandrashekhar , Tony Jebara

Multidimensional User Data Model for Web Personalization

Personalization is being applied to great extend in many systems. This paper presents a multi-dimensional user data model and its application in web search. Online and Offline activities of the user are tracked for creating the user model.…

Information Retrieval · Computer Science 2013-06-20 Nithin K. Anil , Sharath Basil Kurian , Aby Abahai T , Surekha Mariam Varghese

Survey: Multi-Armed Bandits Meet Large Language Models

Bandit algorithms and Large Language Models (LLMs) have emerged as powerful tools in artificial intelligence, each addressing distinct yet complementary challenges in decision-making and natural language processing. This survey explores the…

Artificial Intelligence · Computer Science 2025-10-01 Djallel Bouneffouf , Raphael Feraud

Clustered Bandits

We consider a multi-armed bandit setting that is inspired by real-world applications in e-commerce. In our setting, there are a few types of users, each with a specific response to the different arms. When a user enters the system, his type…

Machine Learning · Computer Science 2015-03-20 Loc Bui , Ramesh Johari , Shie Mannor

Personalize Web Searching Strategies Classification and Comparison

Personalization is becoming very important direction in semantic web search for the users that needs to find appropriate information. In this paper, a classification of web personalization is proposed and semantic web search tools are…

Information Retrieval · Computer Science 2022-03-28 Mariya Evtimova-Gardair , Ivan Momtchev