Related papers: Learning from Demonstrations using Signal Temporal…

Signal Temporal Logic-Guided Apprenticeship Learning

Apprenticeship learning crucially depends on effectively learning rewards, and hence control policies from user demonstrations. Of particular difficulty is the setting where the desired task consists of a number of sub-goals with temporal…

Robotics · Computer Science 2023-11-10 Aniruddh G. Puranic , Jyotirmoy V. Deshmukh , Stefanos Nikolaidis

Model-based Reinforcement Learning from Signal Temporal Logic Specifications

Techniques based on Reinforcement Learning (RL) are increasingly being used to design control policies for robotic systems. RL fundamentally relies on state-based reward functions to encode desired behavior of the robot and bad reward…

Robotics · Computer Science 2020-11-11 Parv Kapoor , Anand Balakrishnan , Jyotirmoy V. Deshmukh

Automata Guided Reinforcement Learning With Demonstrations

Tasks with complex temporal structures and long horizons pose a challenge for reinforcement learning agents due to the difficulty in specifying the tasks in terms of reward functions as well as large variances in the learning signals. We…

Artificial Intelligence · Computer Science 2018-09-27 Xiao Li , Yao Ma , Calin Belta

Safety-Critical Learning of Robot Control with Temporal Logic Specifications

Reinforcement learning (RL) is a promising approach. However, success is limited to real-world applications, because ensuring safe exploration and facilitating adequate exploitation is a challenge for controlling robotic systems with…

Robotics · Computer Science 2022-08-29 Mingyu Cai , Cristian-Ioan Vasile

Maximum Causal Entropy Specification Inference from Demonstrations

In many settings (e.g., robotics) demonstrations provide a natural way to specify tasks; however, most methods for learning from demonstrations either do not provide guarantees that the artifacts learned for the tasks, such as rewards or…

Machine Learning · Computer Science 2020-05-19 Marcell Vazquez-Chanlatte , Sanjit A. Seshia

Elaborating on Learned Demonstrations with Temporal Logic Specifications

Most current methods for learning from demonstrations assume that those demonstrations alone are sufficient to learn the underlying task. This is often untrue, especially if extra safety specifications exist which were not present in the…

Machine Learning · Computer Science 2020-05-26 Craig Innes , Subramanian Ramamoorthy

Reinforcement Learning With Temporal Logic Rewards

Reinforcement learning (RL) depends critically on the choice of reward functions used to capture the de- sired behavior and constraints of a robot. Usually, these are handcrafted by a expert designer and represent heuristics for relatively…

Artificial Intelligence · Computer Science 2017-03-03 Xiao Li , Cristian-Ioan Vasile , Calin Belta

Learning Temporal Logic Predicates from Data with Statistical Guarantees

Temporal logic rules are often used in control and robotics to provide structured, human-interpretable descriptions of trajectory data. These rules have numerous applications including safety validation using formal methods, constraining…

Machine Learning · Computer Science 2025-04-29 Emi Soroka , Rohan Sinha , Sanjay Lall

Temporal Logic Guided Safe Reinforcement Learning Using Control Barrier Functions

Using reinforcement learning to learn control policies is a challenge when the task is complex with potentially long horizons. Ensuring adequate but safe exploration is also crucial for controlling physical systems. In this paper, we use…

Machine Learning · Computer Science 2019-03-26 Xiao Li , Calin Belta

Robust Satisfaction of Temporal Logic Specifications via Reinforcement Learning

We consider the problem of steering a system with unknown, stochastic dynamics to satisfy a rich, temporally layered task given as a signal temporal logic formula. We represent the system as a Markov decision process in which the states are…

Systems and Control · Computer Science 2015-10-23 Austin Jones , Derya Aksaray , Zhaodan Kong , Mac Schwager , Calin Belta

Temporal Logic Imitation: Learning Plan-Satisficing Motion Policies from Demonstrations

Learning from demonstration (LfD) has succeeded in tasks featuring a long time horizon. However, when the problem complexity also includes human-in-the-loop perturbations, state-of-the-art approaches do not guarantee the successful…

Robotics · Computer Science 2024-12-10 Yanwei Wang , Nadia Figueroa , Shen Li , Ankit Shah , Julie Shah

Learning from Imperfect Demonstrations via Temporal Behavior Tree-Guided Trajectory Repair

Learning robot control policies from demonstrations is a powerful paradigm, yet real-world data is often suboptimal, noisy, or otherwise imperfect, posing significant challenges for imitation and reinforcement learning. In this work, we…

Machine Learning · Computer Science 2026-04-07 Aniruddh G. Puranic , Sebastian Schirmer , John S. Baras , Calin Belta

Gaussian-Process-based Robot Learning from Demonstration

Endowed with higher levels of autonomy, robots are required to perform increasingly complex manipulation tasks. Learning from demonstration is arising as a promising paradigm for transferring skills to robots. It allows to implicitly learn…

Robotics · Computer Science 2023-02-24 Miguel Arduengo , Adrià Colomé , Joan Lobo-Prat , Luis Sentis , Carme Torras

On Tackling Complex Tasks with Reward Machines and Signal Temporal Logics

We propose a Reinforcement Learning (RL) based control design framework for handling complex tasks. The approach extends the concept of Reward Machines (RM) with Signal Temporal Logic (STL) formulas that can be used for event generation.…

Artificial Intelligence · Computer Science 2026-04-17 Ana María Gómez Ruiz , Thao Dang , Alexandre Donzé

Learning Performance Graphs from Demonstrations via Task-Based Evaluations

In the learning from demonstration (LfD) paradigm, understanding and evaluating the demonstrated behaviors plays a critical role in extracting control policies for robots. Without this knowledge, a robot may infer incorrect reward functions…

Robotics · Computer Science 2022-12-20 Aniruddh G. Puranic , Jyotirmoy V. Deshmukh , Stefanos Nikolaidis

Boosting Reinforcement Learning and Planning with Demonstrations: A Survey

Although reinforcement learning has seen tremendous success recently, this kind of trial-and-error learning can be impractical or inefficient in complex environments. The use of demonstrations, on the other hand, enables agents to benefit…

Machine Learning · Computer Science 2023-03-29 Tongzhou Mu , Hao Su

Meta-Inverse Reinforcement Learning with Probabilistic Context Variables

Providing a suitable reward function to reinforcement learning can be difficult in many real world applications. While inverse reinforcement learning (IRL) holds promise for automatically learning reward functions from demonstrations,…

Machine Learning · Computer Science 2019-10-29 Lantao Yu , Tianhe Yu , Chelsea Finn , Stefano Ermon

Optimizing Demonstrated Robot Manipulation Skills for Temporal Logic Constraints

For performing robotic manipulation tasks, the core problem is determining suitable trajectories that fulfill the task requirements. Various approaches to compute such trajectories exist, being learning and optimization the main driving…

Robotics · Computer Science 2022-09-08 Akshay Dhonthi , Philipp Schillinger , Leonel Rozo , Daniele Nardi

Learning Robot Skills with Temporal Variational Inference

In this paper, we address the discovery of robotic options from demonstrations in an unsupervised manner. Specifically, we present a framework to jointly learn low-level control policies and higher-level policies of how to use them from…

Machine Learning · Computer Science 2020-06-30 Tanmay Shankar , Abhinav Gupta

LTL-Based Non-Markovian Inverse Reinforcement Learning

The successes of reinforcement learning in recent years are underpinned by the characterization of suitable reward functions. However, in settings where such rewards are non-intuitive, difficult to define, or otherwise error-prone in their…

Formal Languages and Automata Theory · Computer Science 2023-03-02 Mohammad Afzal , Sankalp Gambhir , Ashutosh Gupta , Krishna S , Ashutosh Trivedi , Alvaro Velasquez