English
Related papers

Related papers: Supervised Reward Inference

200 papers

Inferring reward functions from human behavior is at the center of value alignment - aligning AI objectives with what we, humans, actually want. But doing so relies on models of how humans behave given their objectives. After decades of…

Machine Learning · Computer Science 2023-10-31 Joey Hong , Kush Bhatia , Anca Dragan

When a robot learns from human examples, most approaches assume that the human partner provides examples of optimal behavior. However, there are applications in which the robot learns from non-expert humans. We argue that the robot should…

Robotics · Computer Science 2020-11-10 Pamela Carreno-Medrano , Stephen L. Smith , Dana Kulic

Inferring intent from observed behavior has been studied extensively within the frameworks of Bayesian inverse planning and inverse reinforcement learning. These methods infer a goal or reward function that best explains the actions of the…

Machine Learning · Computer Science 2019-01-08 Siddharth Reddy , Anca D. Dragan , Sergey Levine

Robots can learn to imitate humans by inferring what the human is optimizing for. One common framework for this is Bayesian reward learning, where the robot treats the human's demonstrations and corrections as observations of their…

Robotics · Computer Science 2023-10-20 Joshua Hoegerman , Dylan P. Losey

The goal of inverse reinforcement learning (IRL) is to infer a reward function that explains the behavior of an agent performing a task. The assumption that most approaches make is that the demonstrated behavior is near-optimal. In many…

Machine Learning · Computer Science 2020-11-20 Luis Haug , Ivan Ovinnikov , Eugene Bykovets

When robots enter everyday human environments, they need to understand their tasks and how they should perform those tasks. To encode these, reward functions, which specify the objective of a robot, are employed. However, designing reward…

Robotics · Computer Science 2022-10-21 Erdem Bıyık

Our goal is for agents to optimize the right reward function, despite how difficult it is for us to specify what that is. Inverse Reinforcement Learning (IRL) enables us to infer reward functions from demonstrations, but it usually assumes…

Machine Learning · Computer Science 2019-06-25 Rohin Shah , Noah Gundotra , Pieter Abbeel , Anca D. Dragan

Reinforcement learning offers the promise of automating the acquisition of complex behavioral skills. However, compared to commonly used and well-understood supervised learning methods, reinforcement learning algorithms can be brittle,…

Machine Learning · Computer Science 2020-01-01 Aviral Kumar , Xue Bin Peng , Sergey Levine

In complex real-world tasks such as robotic manipulation and autonomous driving, collecting expert demonstrations is often more straightforward than specifying precise learning objectives and task descriptions. Learning from expert data can…

Robotics · Computer Science 2025-05-05 Daulet Baimukashev , Gokhan Alcan , Kevin Sebastian Luck , Ville Kyrki

Reward functions are a common way to specify the objective of a robot. As designing reward functions can be extremely challenging, a more promising approach is to directly learn reward functions from human teachers. Importantly, data from…

A misspecified reward can degrade sample efficiency and induce undesired behaviors in reinforcement learning (RL) problems. We propose symbolic reward machines for incorporating high-level task knowledge when specifying the reward signals.…

Artificial Intelligence · Computer Science 2022-04-22 Weichao Zhou , Wenchao Li

Reward function, as an incentive representation that recognizes humans' agency and rationalizes humans' actions, is particularly appealing for modeling human behavior in human-robot interaction. Inverse Reinforcement Learning is an…

Artificial Intelligence · Computer Science 2021-03-09 Ran Tian , Masayoshi Tomizuka , Liting Sun

Model-free reinforcement learning algorithms have exhibited great potential in solving single-task sequential decision-making problems with high-dimensional observations and long horizons, but are known to be hard to generalize across…

Machine Learning · Computer Science 2023-05-30 Boyuan Chen , Chuning Zhu , Pulkit Agrawal , Kaiqing Zhang , Abhishek Gupta

Inverse Reinforcement Learning infers a reward function from expert demonstrations, aiming to encode the behavior and intentions of the expert. Current approaches usually do this with generative and uni-modal models, meaning that they…

Machine Learning · Computer Science 2021-11-16 Niklas Freymuth , Philipp Becker , Gerhard Neumann

It is often difficult to hand-specify what the correct reward function is for a task, so researchers have instead aimed to learn reward functions from human behavior or feedback. The types of behavior interpreted as evidence of the reward…

Machine Learning · Computer Science 2020-12-14 Hong Jun Jeon , Smitha Milli , Anca D. Dragan

Endowing robots with the human ability to learn a growing set of skills over the course of a lifetime as opposed to mastering single tasks is an open problem in robot learning. While multi-task learning approaches have been proposed to…

Robotics · Computer Science 2023-09-19 Muhammad Burhan Hafez , Stefan Wermter

Preference-based reward learning is a popular technique for teaching robots and autonomous systems how a human user wants them to perform a task. Previous works have shown that actively synthesizing preference queries to maximize…

Robotics · Computer Science 2024-03-12 Evan Ellis , Gaurav R. Ghosal , Stuart J. Russell , Anca Dragan , Erdem Bıyık

Observing a human demonstrator manipulate objects provides a rich, scalable and inexpensive source of data for learning robotic policies. However, transferring skills from human videos to a robotic manipulator poses several challenges, not…

Robotics · Computer Science 2023-03-08 Minttu Alakuijala , Gabriel Dulac-Arnold , Julien Mairal , Jean Ponce , Cordelia Schmid

Traditional imitation learning provides a set of methods and algorithms to learn a reward function or policy from expert demonstrations. Learning from demonstration has been shown to be advantageous for navigation tasks as it allows for…

Robotics · Computer Science 2021-08-03 Christian Ellis , Maggie Wigness , John G. Rogers , Craig Lennon , Lance Fiondella

Designing an effective reward function has long been a challenge in reinforcement learning, particularly for complex tasks in unstructured environments. To address this, various learning paradigms have emerged that leverage different forms…

Machine Learning · Computer Science 2025-04-29 Muhammad Qasim Elahi , Somtochukwu Oguchienti , Maheed H. Ahmed , Mahsa Ghasemi
‹ Prev 1 2 3 10 Next ›