Related papers: Learning Preferences for Interactive Autonomy

Learning Reward Functions from Diverse Sources of Human Feedback: Optimally Integrating Demonstrations and Preferences

Reward functions are a common way to specify the objective of a robot. As designing reward functions can be extremely challenging, a more promising approach is to directly learn reward functions from human teachers. Importantly, data from…

Robotics · Computer Science 2021-08-05 Erdem Bıyık , Dylan P. Losey , Malayandi Palan , Nicholas C. Landolfi , Gleb Shevchuk , Dorsa Sadigh

Preference-based Learning of Reward Function Features

Preference-based learning of reward functions, where the reward function is learned using comparison data, has been well studied for complex robotic tasks such as autonomous driving. Existing algorithms have focused on learning reward…

Robotics · Computer Science 2021-03-05 Sydney M. Katz , Amir Maleki , Erdem Bıyık , Mykel J. Kochenderfer

Learning Reward Functions by Integrating Human Demonstrations and Preferences

Our goal is to accurately and efficiently learn reward functions for autonomous robots. Current approaches to this problem include inverse reinforcement learning (IRL), which uses expert demonstrations, and preference-based learning, which…

Robotics · Computer Science 2019-06-24 Malayandi Palan , Nicholas C. Landolfi , Gleb Shevchuk , Dorsa Sadigh

Unified Learning from Demonstrations, Corrections, and Preferences during Physical Human-Robot Interaction

Humans can leverage physical interaction to teach robot arms. This physical interaction takes multiple forms depending on the task, the user, and what the robot has learned so far. State-of-the-art approaches focus on learning from a single…

Robotics · Computer Science 2024-01-11 Shaunak A. Mehta , Dylan P. Losey

Towards Learning Reward Functions from User Interactions

In the physical world, people have dynamic preferences, e.g., the same situation can lead to satisfaction for some humans and to frustration for others. Personalization is called for. The same observation holds for online behavior with…

Information Retrieval · Computer Science 2017-08-16 Ziming Li , Julia Kiseleva , Maarten de Rijke , Artem Grotov

Learning Reward Functions for Robotic Manipulation by Observing Humans

Observing a human demonstrator manipulate objects provides a rich, scalable and inexpensive source of data for learning robotic policies. However, transferring skills from human videos to a robotic manipulator poses several challenges, not…

Robotics · Computer Science 2023-03-08 Minttu Alakuijala , Gabriel Dulac-Arnold , Julien Mairal , Jean Ponce , Cordelia Schmid

Learning Human Rewards by Inferring Their Latent Intelligence Levels in Multi-Agent Games: A Theory-of-Mind Approach with Application to Driving Data

Reward function, as an incentive representation that recognizes humans' agency and rationalizes humans' actions, is particularly appealing for modeling human behavior in human-robot interaction. Inverse Reinforcement Learning is an…

Artificial Intelligence · Computer Science 2021-03-09 Ran Tian , Masayoshi Tomizuka , Liting Sun

Evaluating the Impact of Personalized Value Alignment in Human-Robot Interaction: Insights into Trust and Team Performance Outcomes

This paper examines the effect of real-time, personalized alignment of a robot's reward function to the human's values on trust and team performance. We present and compare three distinct robot interaction strategies: a non-learner strategy…

Robotics · Computer Science 2023-11-28 Shreyas Bhat , Joseph B. Lyons , Cong Shi , X. Jessie Yang

Reinforcement Learning from Multi-level and Episodic Human Feedback

Designing an effective reward function has long been a challenge in reinforcement learning, particularly for complex tasks in unstructured environments. To address this, various learning paradigms have emerged that leverage different forms…

Machine Learning · Computer Science 2025-04-29 Muhammad Qasim Elahi , Somtochukwu Oguchienti , Maheed H. Ahmed , Mahsa Ghasemi

Inducing Structure in Reward Learning by Learning Features

Reward learning enables robots to learn adaptable behaviors from human input. Traditional methods model the reward as a linear function of hand-crafted features, but that requires specifying all the relevant features a priori, which is…

Robotics · Computer Science 2022-01-19 Andreea Bobu , Marius Wiggert , Claire Tomlin , Anca D. Dragan

Feature Expansive Reward Learning: Rethinking Human Input

When a person is not satisfied with how a robot performs a task, they can intervene to correct it. Reward learning methods enable the robot to adapt its reward function online based on such human input, but they rely on handcrafted…

Robotics · Computer Science 2021-01-13 Andreea Bobu , Marius Wiggert , Claire Tomlin , Anca D. Dragan

Reinforcement Learning Approaches in Social Robotics

This article surveys reinforcement learning approaches in social robotics. Reinforcement learning is a framework for decision-making problems in which an agent interacts through trial-and-error with its environment to discover an optimal…

Robotics · Computer Science 2021-02-12 Neziha Akalin , Amy Loutfi

Effect of Adapting to Human Preferences on Trust in Human-Robot Teaming

We present the effect of adapting to human preferences on trust in a human-robot teaming task. The team performs a task in which the robot acts as an action recommender to the human. It is assumed that the behavior of the human and the…

Robotics · Computer Science 2023-09-12 Shreyas Bhat , Joseph B. Lyons , Cong Shi , X. Jessie Yang

Human Preference Modeling Using Visual Motion Prediction Improves Robot Skill Learning from Egocentric Human Video

We present an approach to robot learning from egocentric human videos by modeling human preferences in a reward function and optimizing robot behavior to maximize this reward. Prior work on reward learning from human videos attempts to…

Robotics · Computer Science 2026-02-13 Mrinal Verghese , Christopher G. Atkeson

Representation Alignment from Human Feedback for Cross-Embodiment Reward Learning from Mixed-Quality Demonstrations

We study the problem of cross-embodiment inverse reinforcement learning, where we wish to learn a reward function from video demonstrations in one or more embodiments and then transfer the learned reward to a different embodiment (e.g.,…

Robotics · Computer Science 2024-08-13 Connor Mattson , Anurag Aribandi , Daniel S. Brown

Explaining Reward Functions to Humans for Better Human-Robot Collaboration

Explainable AI techniques that describe agent reward functions can enhance human-robot collaboration in a variety of settings. One context where human understanding of agent reward functions is particularly beneficial is in the value…

Robotics · Computer Science 2021-10-11 Lindsay Sanneman , Julie Shah

Feedback-efficient Active Preference Learning for Socially Aware Robot Navigation

Socially aware robot navigation, where a robot is required to optimize its trajectory to maintain comfortable and compliant spatial interactions with humans in addition to reaching its goal without collisions, is a fundamental yet…

Robotics · Computer Science 2022-08-02 Ruiqi Wang , Weizheng Wang , Byung-Cheol Min

Intrinsically motivated reinforcement learning for human-robot interaction in the real-world

For a natural social human-robot interaction, it is essential for a robot to learn the human-like social skills. However, learning such skills is notoriously hard due to the limited availability of direct instructions from people to teach a…

Robotics · Computer Science 2018-04-17 Ahmed Hussain Qureshi , Yutaka Nakamura , Yuichiro Yoshikawa , Hiroshi Ishiguro

A Generalized Acquisition Function for Preference-based Reward Learning

Preference-based reward learning is a popular technique for teaching robots and autonomous systems how a human user wants them to perform a task. Previous works have shown that actively synthesizing preference queries to maximize…

Robotics · Computer Science 2024-03-12 Evan Ellis , Gaurav R. Ghosal , Stuart J. Russell , Anca Dragan , Erdem Bıyık

Pragmatic Feature Preferences: Learning Reward-Relevant Preferences from Human Input

Humans use social context to specify preferences over behaviors, i.e. their reward functions. Yet, algorithms for inferring reward models from preference data do not take this social learning view into account. Inspired by pragmatic human…

Machine Learning · Computer Science 2024-05-24 Andi Peng , Yuying Sun , Tianmin Shu , David Abel