Related papers: Incentive Compatible Active Learning

On the Pros and Cons of Active Learning for Moral Preference Elicitation

Computational preference elicitation methods are tools used to learn people's preferences quantitatively in a given context. Recent works on preference elicitation advocate for active learning as an efficient method to iteratively construct…

Human-Computer Interaction · Computer Science 2024-07-29 Vijay Keswani , Vincent Conitzer , Hoda Heidari , Jana Schaich Borg , Walter Sinnott-Armstrong

Geometry Meets Incentives: Sample-Efficient Incentivized Exploration with Linear Contexts

In the incentivized exploration model, a principal aims to explore and learn over time by interacting with a sequence of self-interested agents. It has been recently understood that the main challenge in designing incentive-compatible…

Computer Science and Game Theory · Computer Science 2025-06-03 Benjamin Schiffer , Mark Sellke

Active Algorithms For Preference Learning Problems with Multiple Populations

In this paper we model the problem of learning preferences of a population as an active learning problem. We propose an algorithm can adaptively choose pairs of items to show to users coming from a heterogeneous population, and use the…

Machine Learning · Statistics 2016-06-23 Aniruddha Bhargava , Ravi Ganti , Robert Nowak

Active Learning within Constrained Environments through Imitation of an Expert Questioner

Active learning agents typically employ a query selection algorithm which solely considers the agent's learning objectives. However, this may be insufficient in more realistic human domains. This work uses imitation learning to enable an…

Machine Learning · Computer Science 2019-07-02 Kalesha Bullard , Yannick Schroecker , Sonia Chernova

Batch Active Preference-Based Learning of Reward Functions

Data generation and labeling are usually an expensive part of learning for robotics. While active learning methods are commonly used to tackle the former problem, preference-based learning is a concept that attempts to solve the latter by…

Machine Learning · Computer Science 2018-10-11 Erdem Bıyık , Dorsa Sadigh

Nearly Optimal Active Preference Learning and Its Application to LLM Alignment

Aligning large language models (LLMs) depends on high-quality datasets of human preference labels, which are costly to collect. Although active learning has been studied to improve sample efficiency relative to passive collection, many…

Machine Learning · Computer Science 2026-02-03 Yao Zhao , Kwang-Sung Jun

Active Learning for Matching Problems

Effective learning of user preferences is critical to easing user burden in various types of matching problems. Equally important is active query selection to further reduce the amount of preference information users must provide. We…

Machine Learning · Computer Science 2012-06-22 Laurent Charlin , Rich Zemel , Craig Boutilier

Admissible Policy Teaching through Reward Design

We study reward design strategies for incentivizing a reinforcement learning agent to adopt a policy from a set of admissible policies. The goal of the reward designer is to modify the underlying reward function cost-efficiently while…

Machine Learning · Computer Science 2022-01-07 Kiarash Banihashem , Adish Singla , Jiarui Gan , Goran Radanovic

Incentive-Compatible Experimental Design

We consider the design of experiments to evaluate treatments that are administered by self-interested agents, each seeking to achieve the highest evaluation and win the experiment. For example, in an advertising experiment, a company wishes…

Methodology · Statistics 2015-09-18 Panos Toulis , David C. Parkes , Elery Pfeffer , James Zou

Adapting Behaviour via Intrinsic Reward: A Survey and Empirical Study

Learning about many things can provide numerous benefits to a reinforcement learning system. For example, learning many auxiliary value functions, in addition to optimizing the environmental reward, appears to improve both exploration and…

Machine Learning · Computer Science 2020-08-25 Cam Linke , Nadia M. Ady , Martha White , Thomas Degris , Adam White

Active Preference Learning using Maximum Regret

We study active preference learning as a framework for intuitively specifying the behaviour of autonomous robots. In active preference learning, a user chooses the preferred behaviour from a set of alternatives, from which the robot learns…

Robotics · Computer Science 2020-09-30 Nils Wilde , Dana Kulic , Stephen L. Smith

Towards Improving Exploration in Self-Imitation Learning using Intrinsic Motivation

Reinforcement Learning has emerged as a strong alternative to solve optimization tasks efficiently. The use of these algorithms highly depends on the feedback signals provided by the environment in charge of informing about how good (or…

Machine Learning · Computer Science 2022-12-01 Alain Andres , Esther Villar-Rodriguez , Javier Del Ser

Incentivized Learning in Principal-Agent Bandit Games

This work considers a repeated principal-agent bandit game, where the principal can only interact with her environment through the agent. The principal and the agent have misaligned objectives and the choice of action is only left to the…

Machine Learning · Statistics 2024-03-07 Antoine Scheid , Daniil Tiapkin , Etienne Boursier , Aymeric Capitaine , El Mahdi El Mhamdi , Eric Moulines , Michael I. Jordan , Alain Durmus

Intuitiveness in Active Teaching

While Machine learning gives rise to astonishing results in automated systems, it is usually at the cost of large data requirements. This makes many successful algorithms from machine learning unsuitable for human-machine interaction, where…

Human-Computer Interaction · Computer Science 2021-09-30 Jan Philip Göpfert , Ulrike Kuhl , Lukas Hindemith , Heiko Wersing , Barbara Hammer

Active learning algorithm through the lens of rejection arguments

Active learning is a paradigm of machine learning which aims at reducing the amount of labeled data needed to train a classifier. Its overall principle is to sequentially select the most informative data points, which amounts to determining…

Statistics Theory · Mathematics 2022-09-01 Christophe Denis , Mohamed Hebiri , Boris Ndjia Njike , Xavier Siebert

Adaptive Incentive Design with Multi-Agent Meta-Gradient Reinforcement Learning

Critical sectors of human society are progressing toward the adoption of powerful artificial intelligence (AI) agents, which are trained individually on behalf of self-interested principals but deployed in a shared environment. Short of…

Multiagent Systems · Computer Science 2021-12-22 Jiachen Yang , Ethan Wang , Rakshit Trivedi , Tuo Zhao , Hongyuan Zha

Constrained Exploration and Recovery from Experience Shaping

We consider the problem of reinforcement learning under safety requirements, in which an agent is trained to complete a given task, typically formalized as the maximization of a reward signal over time, while concurrently avoiding…

Machine Learning · Computer Science 2018-09-25 Tu-Hoa Pham , Giovanni De Magistris , Don Joven Agravante , Subhajit Chaudhury , Asim Munawar , Ryuki Tachibana

Active Preference Inference using Language Models and Probabilistic Reasoning

Actively inferring user preferences, for example by asking good questions, is important for any human-facing decision-making system. Active inference allows such systems to adapt and personalize themselves to nuanced individual preferences.…

Computation and Language · Computer Science 2024-06-27 Wasu Top Piriyakulkij , Volodymyr Kuleshov , Kevin Ellis

Imitating Cost-Constrained Behaviors in Reinforcement Learning

Complex planning and scheduling problems have long been solved using various optimization or heuristic approaches. In recent years, imitation learning that aims to learn from expert demonstrations has been proposed as a viable alternative…

Machine Learning · Computer Science 2024-05-24 Qian Shao , Pradeep Varakantham , Shih-Fen Cheng

Learning to Actively Learn: A Robust Approach

This work proposes a procedure for designing algorithms for specific adaptive data collection tasks like active learning and pure-exploration multi-armed bandits. Unlike the design of traditional adaptive algorithms that rely on…

Machine Learning · Computer Science 2025-03-11 Jifan Zhang , Lalit Jain , Kevin Jamieson