Related papers: Active Preference Learning using Maximum Regret

Towards Preference Learning for Autonomous Ground Robot Navigation Tasks

We are interested in the design of autonomous robot behaviors that learn the preferences of users over continued interactions, with the goal of efficiently executing navigation behaviors in a way that the user expects. In this paper, we…

Robotics · Computer Science 2020-11-06 Cory Hayes , Matthew Marge

Improving User Specifications for Robot Behavior through Active Preference Learning: Framework and Evaluation

An important challenge in human-robot interaction (HRI) is enabling non-expert users to specify complex tasks for autonomous robots. Recently, active preference learning has been applied in HRI to interactively shape a robot's behavior. We…

Robotics · Computer Science 2020-03-19 Nils Wilde , Alexandru Blidaru , Stephen L. Smith , Dana Kulić

Batch Active Preference-Based Learning of Reward Functions

Data generation and labeling are usually an expensive part of learning for robotics. While active learning methods are commonly used to tackle the former problem, preference-based learning is a concept that attempts to solve the latter by…

Machine Learning · Computer Science 2018-10-11 Erdem Bıyık , Dorsa Sadigh

A Generalized Acquisition Function for Preference-based Reward Learning

Preference-based reward learning is a popular technique for teaching robots and autonomous systems how a human user wants them to perform a task. Previous works have shown that actively synthesizing preference queries to maximize…

Robotics · Computer Science 2024-03-12 Evan Ellis , Gaurav R. Ghosal , Stuart J. Russell , Anca Dragan , Erdem Bıyık

Active Reward Learning from Online Preferences

Robot policies need to adapt to human preferences and/or new environments. Human experts may have the domain knowledge required to help robots achieve this adaptation. However, existing works often require costly offline re-training on…

Machine Learning · Computer Science 2023-02-28 Vivek Myers , Erdem Bıyık , Dorsa Sadigh

Learning Preferences for Manipulation Tasks from Online Coactive Feedback

We consider the problem of learning preferences over trajectories for mobile manipulators such as personal robots and assembly line robots. The preferences we learn are more intricate than simple geometric constraints on trajectories; they…

Robotics · Computer Science 2016-01-06 Ashesh Jain , Shikhar Sharma , Thorsten Joachims , Ashutosh Saxena

Bayesian Active Learning for Collaborative Task Specification Using Equivalence Regions

Specifying complex task behaviours while ensuring good robot performance may be difficult for untrained users. We study a framework for users to specify rules for acceptable behaviour in a shared environment such as industrial facilities.…

Robotics · Computer Science 2019-07-25 Nils Wilde , Dana Kulic , Stephen L. Smith

Online Learning and Profit Maximization from Revealed Preferences

We consider the problem of learning from revealed preferences in an online setting. In our framework, each period a consumer buys an optimal bundle of goods from a merchant according to her (linear) utility function and current prices,…

Data Structures and Algorithms · Computer Science 2014-12-02 Kareem Amin , Rachel Cummings , Lili Dworkin , Michael Kearns , Aaron Roth

Active Algorithms For Preference Learning Problems with Multiple Populations

In this paper we model the problem of learning preferences of a population as an active learning problem. We propose an algorithm can adaptively choose pairs of items to show to users coming from a heterogeneous population, and use the…

Machine Learning · Statistics 2016-06-23 Aniruddha Bhargava , Ravi Ganti , Robert Nowak

Learning Human Preferences Over Robot Behavior as Soft Planning Constraints

Preference learning has long been studied in Human-Robot Interaction (HRI) in order to adapt robot behavior to specific user needs and desires. Typically, human preferences are modeled as a scalar function; however, such a formulation…

Robotics · Computer Science 2024-04-01 Austin Narcomey , Nathan Tsoi , Ruta Desai , Marynel Vázquez

Batch Active Learning of Reward Functions from Human Preferences

Data generation and labeling are often expensive in robot learning. Preference-based learning is a concept that enables reliable labeling by querying users with preference questions. Active querying methods are commonly employed in…

Machine Learning · Computer Science 2024-02-27 Erdem Bıyık , Nima Anari , Dorsa Sadigh

Optimal Cost-Preference Trade-off Planning with Multiple Temporal Tasks

Autonomous robots are increasingly utilized in realistic scenarios with multiple complex tasks. In these scenarios, there may be a preferred way of completing all of the given tasks, but it is often in conflict with optimal execution.…

Robotics · Computer Science 2023-06-26 Peter Amorese , Morteza Lahijanian

Active Learning for Matching Problems

Effective learning of user preferences is critical to easing user burden in various types of matching problems. Equally important is active query selection to further reduce the amount of preference information users must provide. We…

Machine Learning · Computer Science 2012-06-22 Laurent Charlin , Rich Zemel , Craig Boutilier

Exploiting Prior Knowledge in Preferential Learning of Individualized Autonomous Vehicle Driving Styles

Trajectory planning for automated vehicles commonly employs optimization over a moving horizon - Model Predictive Control - where the cost function critically influences the resulting driving style. However, finding a suitable cost function…

Systems and Control · Electrical Eng. & Systems 2025-10-20 Lukas Theiner , Sebastian Hirt , Alexander Steinke , Rolf Findeisen

Online Learning with Preference Feedback

We propose a new online learning model for learning with preference feedback. The model is especially suited for applications like web search and recommender systems, where preference data is readily available from implicit user feedback…

Machine Learning · Computer Science 2011-11-04 Pannagadatta K. Shivaswamy , Thorsten Joachims

Bayes-Optimal Entropy Pursuit for Active Choice-Based Preference Learning

We analyze the problem of learning a single user's preferences in an active learning setting, sequentially and adaptively querying the user over a finite time horizon. Learning is conducted via choice-based queries, where the user selects…

Machine Learning · Statistics 2017-02-27 Stephen N. Pallone , Peter I. Frazier , Shane G. Henderson

Learning Trajectory Preferences for Manipulators via Iterative Improvement

We consider the problem of learning good trajectories for manipulation tasks. This is challenging because the criterion defining a good trajectory varies with users, tasks and environments. In this paper, we propose a co-active online…

Robotics · Computer Science 2015-01-30 Ashesh Jain , Brian Wojcik , Thorsten Joachims , Ashutosh Saxena

APReL: A Library for Active Preference-based Reward Learning Algorithms

Reward learning is a fundamental problem in human-robot interaction to have robots that operate in alignment with what their human user wants. Many preference-based learning algorithms and active querying techniques have been proposed as a…

Machine Learning · Computer Science 2022-01-05 Erdem Bıyık , Aditi Talati , Dorsa Sadigh

Incentive Compatible Active Learning

We consider active learning under incentive compatibility constraints. The main application of our results is to economic experiments, in which a learner seeks to infer the parameters of a subject's preferences: for example their attitudes…

Computer Science and Game Theory · Computer Science 2019-11-15 Federico Echenique , Siddharth Prasad

Asking Easy Questions: A User-Friendly Approach to Active Reward Learning

Robots can learn the right reward function by querying a human expert. Existing approaches attempt to choose questions where the robot is most uncertain about the human's response; however, they do not consider how easy it will be for the…

Robotics · Computer Science 2019-10-11 Erdem Bıyık , Malayandi Palan , Nicholas C. Landolfi , Dylan P. Losey , Dorsa Sadigh