English
Related papers

Related papers: Learning Task Specifications from Demonstrations

200 papers

In many settings (e.g., robotics) demonstrations provide a natural way to specify tasks; however, most methods for learning from demonstrations either do not provide guarantees that the artifacts learned for the tasks, such as rewards or…

Machine Learning · Computer Science 2020-05-19 Marcell Vazquez-Chanlatte , Sanjit A. Seshia

Learning-from-demonstrations is an emerging paradigm to obtain effective robot control policies for complex tasks via reinforcement learning without the need to explicitly design reward functions. However, it is susceptible to imperfections…

Robotics · Computer Science 2021-02-16 Aniruddh G. Puranic , Jyotirmoy V. Deshmukh , Stefanos Nikolaidis

When observing task demonstrations, human apprentices are able to identify whether a given task is executed correctly long before they gain expertise in actually performing that task. Prior research into learning from demonstrations (LfD)…

Artificial Intelligence · Computer Science 2021-07-08 Ankit Shah , Pritish Kamath , Shen Li , Patrick Craven , Kevin Landers , Kevin Oden , Julie Shah

We extend the learning from demonstration paradigm by providing a method for learning unknown constraints shared across tasks, using demonstrations of the tasks, their cost functions, and knowledge of the system dynamics and control…

Robotics · Computer Science 2019-02-22 Glen Chou , Dmitry Berenson , Necmiye Ozay

Specifying tasks for robotic systems traditionally requires coding expertise, deep domain knowledge, and significant time investment. While learning from demonstration offers a promising alternative, existing methods often struggle with…

Robotics · Computer Science 2024-09-12 Mattijs Baert , Sam Leroux , Pieter Simoens

Learning reward functions from demonstrations assumes that demonstrations provide adequate supervision over all features -- or task-relevant aspects of behavior. In practice, demonstrations are often imperfect: humans may under-emphasize…

Robotics · Computer Science 2026-05-25 Helena Merker , Nick Walker , Andreea Bobu

Learning from Demonstration~(LfD) should capture not only how a task is executed, but also its high-level task structure that explains the demonstrated behavior. As robots become more autonomous, such task representations must be…

Robotics · Computer Science 2026-05-27 Oleh Borys , Karla Stepanova

In the learning from demonstration (LfD) paradigm, understanding and evaluating the demonstrated behaviors plays a critical role in extracting control policies for robots. Without this knowledge, a robot may infer incorrect reward functions…

Robotics · Computer Science 2022-12-20 Aniruddh G. Puranic , Jyotirmoy V. Deshmukh , Stefanos Nikolaidis

Most current methods for learning from demonstrations assume that those demonstrations alone are sufficient to learn the underlying task. This is often untrue, especially if extra safety specifications exist which were not present in the…

Machine Learning · Computer Science 2020-05-26 Craig Innes , Subramanian Ramamoorthy

This paper considers the problem of learning temporal task specifications, e.g. automata and temporal logic, from expert demonstrations. Task specifications are a class of sparse memory augmented rewards with explicit support for temporal…

Artificial Intelligence · Computer Science 2023-04-25 Marcell Vazquez-Chanlatte , Ameesh Shah , Gil Lederman , Sanjit A. Seshia

Reward engineering is crucial to high performance in reinforcement learning systems. Prior research into reward design has largely focused on Markovian functions representing the reward. While there has been research into expressing…

Robotics · Computer Science 2020-03-03 Ankit Shah , Shen Li , Julie Shah

The principle of maximum entropy is a broadly applicable technique for computing a distribution with the least amount of information possible while constrained to match empirically estimated feature expectations. However, in many real-world…

Machine Learning · Computer Science 2022-08-16 Kenneth Bogert , Yikang Gui , Prashant Doshi

Reinforcement learning is a promising approach for learning control policies for robot tasks. However, specifying complex tasks (e.g., with multiple objectives and safety constraints) can be challenging, since the user must design a reward…

Machine Learning · Computer Science 2020-10-30 Kishor Jothimurugan , Rajeev Alur , Osbert Bastani

Imitation learning (IL) algorithms use expert demonstrations to learn a specific task. Most of the existing approaches assume that all expert demonstrations are reliable and trustworthy, but what if there exist some adversarial…

Machine Learning · Computer Science 2021-01-06 Mostafa Hussein , Brendan Crowe , Marek Petrik , Momotaz Begum

Reward design is a fundamental problem in reinforcement learning (RL). A misspecified or poorly designed reward can result in low sample efficiency and undesired behaviors. In this paper, we propose the idea of programmatic reward design,…

Machine Learning · Computer Science 2022-01-10 Weichao Zhou , Wenchao Li

Imitation learning enables robots to learn from demonstrations. Previous imitation learning algorithms usually assume access to optimal expert demonstrations. However, in many real-world applications, this assumption is limiting. Most…

Machine Learning · Computer Science 2021-03-11 Zhangjie Cao , Dorsa Sadigh

Inverse reinforcement learning (IRL) infers a reward function from demonstrations, allowing for policy improvement and generalization. However, despite much recent interest in IRL, little work has been done to understand the minimum set of…

Machine Learning · Computer Science 2019-08-19 Daniel S. Brown , Scott Niekum

Multi-step manipulation tasks where robots interact with their environment and must apply process forces based on the perceived situation remain challenging to learn and prone to execution errors. Accurately simulating these tasks is also…

Robotics · Computer Science 2025-05-08 Christoph Willibald , Dongheui Lee

Temporal logic specifications play an important role in a wide range of software analysis tasks, such as model checking, automated synthesis, program comprehension, and runtime monitoring. Given a set of positive and negative examples,…

Software Engineering · Computer Science 2025-01-03 Changjian Zhang , Parv Kapoor , Ian Dardik , Leyi Cui , Romulo Meira-Goes , David Garlan , Eunsuk Kang

When an expert operates a perilous dynamic system, ideal constraint information is tacitly contained in their demonstrated trajectories and controls. The likelihood of these demonstrations can be computed, given the system dynamics and task…

Systems and Control · Electrical Eng. & Systems 2021-02-26 David L. McPherson , Kaylene C. Stocking , S. Shankar Sastry
‹ Prev 1 2 3 10 Next ›