Related papers: Multi-level Feedback Web Links Selection Problem: …

Beyond the Click-Through Rate: Web Link Selection with Multi-level Feedback

The web link selection problem is to select a small subset of web links from a large web link pool, and to place the selected links on a web page that can only accommodate a limited number of links, e.g., advertisements, recommendations, or…

Machine Learning · Computer Science 2018-05-07 Kun Chen , Kechao Cai , Longbo Huang , John C. S. Lui

DCM Bandits: Learning to Rank with Multiple Clicks

A search engine recommends to the user a list of web pages. The user examines this list, from the first page to the last, and clicks on all attractive pages until the user is satisfied. This behavior of the user can be described by the…

Machine Learning · Computer Science 2016-06-02 Sumeet Katariya , Branislav Kveton , Csaba Szepesvári , Zheng Wen

Online Multi-LLM Selection via Contextual Bandits under Unstructured Context Evolution

Large language models (LLMs) exhibit diverse response behaviors, costs, and strengths, making it challenging to select the most suitable LLM for a given user query. We study the problem of adaptive multi-LLM selection in an online setting,…

Machine Learning · Computer Science 2025-06-24 Manhin Poon , XiangXiang Dai , Xutong Liu , Fang Kong , John C. S. Lui , Jinhang Zuo

Top-K Ranking Deep Contextual Bandits for Information Selection Systems

In today's technology environment, information is abundant, dynamic, and heterogeneous in nature. Automated filtering and prioritization of information is based on the distinction between whether the information adds substantial value…

Machine Learning · Computer Science 2022-02-01 Jade Freeman , Michael Rawson

LinkGPT: Teaching Large Language Models To Predict Missing Links

Large Language Models (LLMs) have shown promising results on various language and vision tasks. Recently, there has been growing interest in applying LLMs to graph-based tasks, particularly on Text-Attributed Graphs (TAGs). However, most…

Machine Learning · Computer Science 2024-06-10 Zhongmou He , Jing Zhu , Shengyi Qian , Joyce Chai , Danai Koutra

Automatic Link Selection in Multi-Channel Multiple Access with Link Failures

This paper focuses on the problem of automatic link selection in multi-channel multiple access control using bandit feedback. In particular, a controller assigns multiple users to multiple channels in a time-slotted system, where in each…

Systems and Control · Electrical Eng. & Systems 2026-03-09 Mevan Wijewardena , Michael J. Neely , Haipeng Luo

Neural Dueling Bandits: Preference-Based Optimization with Human Feedback

Contextual dueling bandit is used to model the bandit problems, where a learner's goal is to find the best arm for a given context using observed noisy human preference feedback over the selected arms for the past contexts. However,…

Machine Learning · Computer Science 2025-04-17 Arun Verma , Zhongxiang Dai , Xiaoqiang Lin , Patrick Jaillet , Bryan Kian Hsiang Low

Cascading Hybrid Bandits: Online Learning to Rank for Relevance and Diversity

Relevance ranking and result diversification are two core areas in modern recommender systems. Relevance ranking aims at building a ranked list sorted in decreasing order of item relevance, while result diversification focuses on generating…

Machine Learning · Computer Science 2020-08-13 Chang Li , Haoyun Feng , Maarten de Rijke

Learning-to-Rank with Nested Feedback

Many platforms on the web present ranked lists of content to users, typically optimized for engagement-, satisfaction- or retention- driven metrics. Advances in the Learning-to-Rank (LTR) research literature have enabled rapid growth in…

Information Retrieval · Computer Science 2024-01-09 Hitesh Sagtani , Olivier Jeunen , Aleksei Ustimenko

Learning Context-Aware Representations of Subtrees

This thesis tackles the problem of learning efficient representations of complex, structured data with a natural application to web page and element classification. We hypothesise that the context around the element inside the web page is…

Machine Learning · Computer Science 2021-11-09 Cedric Cook

Literature Review Of Attribute Level And Structure Level Data Linkage Techniques

Data Linkage is an important step that can provide valuable insights for evidence-based decision making, especially for crucial events. Performing sensible queries across heterogeneous databases containing millions of records is a complex…

Databases · Computer Science 2015-10-09 Mohammed Gollapalli

Unbiased Learning-to-Rank with Biased Feedback

Implicit feedback (e.g., clicks, dwell times, etc.) is an abundant source of data in human-interactive systems. While implicit feedback has many advantages (e.g., it is inexpensive to collect, user centric, and timely), its inherent biases…

Information Retrieval · Computer Science 2016-08-17 Thorsten Joachims , Adith Swaminathan , Tobias Schnabel

A Study of Context Dependencies in Multi-page Product Search

In product search, users tend to browse results on multiple search result pages (SERPs) (e.g., for queries on clothing and shoes) before deciding which item to purchase. Users' clicks can be considered as implicit feedback which indicates…

Information Retrieval · Computer Science 2020-01-10 Keping Bi , Choon Hui Teo , Yesh Dattatreya , Vijai Mohan , W. Bruce Croft

Sequential ranking under random semi-bandit feedback

In many web applications, a recommendation is not a single item suggested to a user but a list of possibly interesting contents that may be ranked in some contexts. The combinatorial bandit problem has been studied quite extensively these…

Data Structures and Algorithms · Computer Science 2016-05-27 Hossein Vahabi , Paul Lagrée , Claire Vernade , Olivier Cappé

A Multilingual Entity Linking System for Wikipedia with a Machine-in-the-Loop Approach

Hyperlinks constitute the backbone of the Web; they enable user navigation, information discovery, content ranking, and many other crucial services on the Internet. In particular, hyperlinks found within Wikipedia allow the readers to…

Computers and Society · Computer Science 2021-06-01 Martin Gerlach , Marshall Miller , Rita Ho , Kosta Harlan , Djellel Difallah

Bernoulli Rank-$1$ Bandits for Click Feedback

The probability that a user will click a search result depends both on its relevance and its position on the results page. The position based model explains this behavior by ascribing to every item an attraction probability, and to every…

Machine Learning · Computer Science 2017-03-21 Sumeet Katariya , Branislav Kveton , Csaba Szepesvári , Claire Vernade , Zheng Wen

Efficient Contextual Bandits with Uninformed Feedback Graphs

Bandits with feedback graphs are powerful online learning models that interpolate between the full information and classic bandit problems, capturing many real-life applications. A recent work by Zhang et al. (2023) studies the contextual…

Machine Learning · Computer Science 2024-02-14 Mengxiao Zhang , Yuheng Zhang , Haipeng Luo , Paul Mineiro

When Do We Need LLMs? A Diagnostic for Language-Driven Bandits

We study Contextual Multi-Armed Bandits (CMABs) for non-episodic sequential decision making problems where the context includes both textual and numerical information (e.g., recommendation systems, dynamic portfolio adjustments, offer…

Artificial Intelligence · Computer Science 2026-04-08 Uljad Berdica , Fernando Acero , Anton Ipsen , Parisa Zehtabi , Michael Cashmore , Manuela Veloso

PageLLM: A Multi-Grained Reward Framework for Whole-Page Optimization with Large Language Models

Whole-page optimization (WPO) decides how search and recommendation results are surfaced to users, and large language models (LLMs) open a new route to it by treating page generation as sequence generation. Adapting LLMs to web-scale WPO,…

Machine Learning · Computer Science 2026-05-26 Xinyuan Wang , Liang Wu , Dongjie Wang , Yanjie Fu

A Contextual-Bandit Approach to Personalized News Article Recommendation

Personalized web services strive to adapt their services (advertisements, news articles, etc) to individual users by making use of both content and user information. Despite a few recent advances, this problem remains challenging for at…

Machine Learning · Computer Science 2012-03-05 Lihong Li , Wei Chu , John Langford , Robert E. Schapire