Related papers: Learning Shared Safety Constraints from Multi-task…

Learning Soft Constraints From Constrained Expert Demonstrations

Inverse reinforcement learning (IRL) methods assume that the expert data is generated by an agent optimizing some reward function. However, in many settings, the agent may optimize a reward function subject to some constraints, where the…

Machine Learning · Computer Science 2023-05-01 Ashish Gaurav , Kasra Rezaee , Guiliang Liu , Pascal Poupart

DIAL: Distribution-Informed Adaptive Learning of Multi-Task Constraints for Safety-Critical Systems

Safe reinforcement learning has traditionally relied on predefined constraint functions to ensure safety in complex real-world tasks, such as autonomous driving. However, defining these functions accurately for varied tasks is a persistent…

Machine Learning · Computer Science 2025-01-31 Se-Wook Yoo , Seung-Woo Seo

Confidence Aware Inverse Constrained Reinforcement Learning

In coming up with solutions to real-world problems, humans implicitly adhere to constraints that are too numerous and complex to be specified completely. However, reinforcement learning (RL) agents need these constraints to learn the…

Machine Learning · Computer Science 2024-06-25 Sriram Ganapathi Subramanian , Guiliang Liu , Mohammed Elmahgiubi , Kasra Rezaee , Pascal Poupart

Reinforcement Learning with Convex Constraints

In standard reinforcement learning (RL), a learning agent seeks to optimize the overall reward. However, many key aspects of a desired behavior are more naturally expressed as constraints. For instance, the designer may want to limit the…

Machine Learning · Computer Science 2021-01-29 Sobhan Miryoosefi , Kianté Brantley , Hal Daumé , Miroslav Dudik , Robert Schapire

Meta-Inverse Reinforcement Learning with Probabilistic Context Variables

Providing a suitable reward function to reinforcement learning can be difficult in many real world applications. While inverse reinforcement learning (IRL) holds promise for automatically learning reward functions from demonstrations,…

Machine Learning · Computer Science 2019-10-29 Lantao Yu , Tianhe Yu , Chelsea Finn , Stefano Ermon

Learning Constraints from Demonstrations

We extend the learning from demonstration paradigm by providing a method for learning unknown constraints shared across tasks, using demonstrations of the tasks, their cost functions, and knowledge of the system dynamics and control…

Robotics · Computer Science 2019-02-22 Glen Chou , Dmitry Berenson , Necmiye Ozay

Constrained Meta Reinforcement Learning with Provable Test-Time Safety

Meta reinforcement learning (RL) allows agents to leverage experience across a distribution of tasks on which the agent can train at will, enabling faster learning of optimal policies on new test tasks. Despite its success in improving…

Machine Learning · Computer Science 2026-05-27 Tingting Ni , Maryam Kamgarpour

Learning Variable Impedance Control via Inverse Reinforcement Learning for Force-Related Tasks

Many manipulation tasks require robots to interact with unknown environments. In such applications, the ability to adapt the impedance according to different task phases and environment constraints is crucial for safety and performance.…

Robotics · Computer Science 2021-02-16 Xiang Zhang , Liting Sun , Zhian Kuang , Masayoshi Tomizuka

Inverse Constrained Reinforcement Learning

In real world settings, numerous constraints are present which are hard to specify mathematically. However, for the real world deployment of reinforcement learning (RL), it is critical that RL agents are aware of these constraints, so that…

Machine Learning · Computer Science 2021-05-24 Usman Anwar , Shehryar Malik , Alireza Aghasi , Ali Ahmed

Constrained Exploration and Recovery from Experience Shaping

We consider the problem of reinforcement learning under safety requirements, in which an agent is trained to complete a given task, typically formalized as the maximization of a reward signal over time, while concurrently avoiding…

Machine Learning · Computer Science 2018-09-25 Tu-Hoa Pham , Giovanni De Magistris , Don Joven Agravante , Subhajit Chaudhury , Asim Munawar , Ryuki Tachibana

Benchmarking Constraint Inference in Inverse Reinforcement Learning

When deploying Reinforcement Learning (RL) agents into a physical system, we must ensure that these agents are well aware of the underlying constraints. In many real-world problems, however, the constraints are often hard to specify…

Machine Learning · Computer Science 2023-03-03 Guiliang Liu , Yudong Luo , Ashish Gaurav , Kasra Rezaee , Pascal Poupart

Constrained-Space Optimization and Reinforcement Learning for Complex Tasks

Learning from Demonstration is increasingly used for transferring operator manipulation skills to robots. In practice, it is important to cater for limited data and imperfect human demonstrations, as well as underlying safety constraints.…

Robotics · Computer Science 2020-04-03 Ya-Yen Tsai , Bo Xiao , Edward Johns , Guang-Zhong Yang

Learning a Prior over Intent via Meta-Inverse Reinforcement Learning

A significant challenge for the practical application of reinforcement learning in the real world is the need to specify an oracle reward function that correctly defines a task. Inverse reinforcement learning (IRL) seeks to avoid this…

Machine Learning · Computer Science 2019-10-16 Kelvin Xu , Ellis Ratner , Anca Dragan , Sergey Levine , Chelsea Finn

Safe Reinforcement Learning on the Constraint Manifold: Theory and Applications

Integrating learning-based techniques, especially reinforcement learning, into robotics is promising for solving complex problems in unstructured environments. However, most existing approaches are trained in well-tuned simulators and…

Robotics · Computer Science 2024-11-07 Puze Liu , Haitham Bou-Ammar , Jan Peters , Davide Tateo

Learning Safety Constraints from Demonstrations with Unknown Rewards

We propose Convex Constraint Learning for Reinforcement Learning (CoCoRL), a novel approach for inferring shared constraints in a Constrained Markov Decision Process (CMDP) from a set of safe demonstrations with possibly different reward…

Machine Learning · Computer Science 2024-03-05 David Lindner , Xin Chen , Sebastian Tschiatschek , Katja Hofmann , Andreas Krause

Online Learning of Human Constraints from Feedback in Shared Autonomy

Real-time collaboration with humans poses challenges due to the different behavior patterns of humans resulting from diverse physical constraints. Existing works typically focus on learning safety constraints for collaboration, or how to…

Robotics · Computer Science 2024-03-06 Shibei Zhu , Tran Nguyen Le , Samuel Kaski , Ville Kyrki

Making Human-Like Trade-offs in Constrained Environments by Learning from Demonstrations

Many real-life scenarios require humans to make difficult trade-offs: do we always follow all the traffic rules or do we violate the speed limit in an emergency? These scenarios force us to evaluate the trade-off between collective norms…

Artificial Intelligence · Computer Science 2021-09-24 Arie Glazier , Andrea Loreggia , Nicholas Mattei , Taher Rahgooy , Francesca Rossi , K. Brent Venable

Meta Inverse Reinforcement Learning via Maximum Reward Sharing for Human Motion Analysis

This work handles the inverse reinforcement learning (IRL) problem where only a small number of demonstrations are available from a demonstrator for each high-dimensional task, insufficient to estimate an accurate reward function. Observing…

Artificial Intelligence · Computer Science 2017-10-16 Kun Li , Joel W. Burdick

Active Learning for Risk-Sensitive Inverse Reinforcement Learning

One typical assumption in inverse reinforcement learning (IRL) is that human experts act to optimize the expected utility of a stochastic cost with a fixed distribution. This assumption deviates from actual human behaviors under ambiguity.…

Machine Learning · Computer Science 2019-09-25 Rui Chen , Wenshuo Wang , Zirui Zhao , Ding Zhao

Learning Behavioral Soft Constraints from Demonstrations

Many real-life scenarios require humans to make difficult trade-offs: do we always follow all the traffic rules or do we violate the speed limit in an emergency? These scenarios force us to evaluate the trade-off between collective rules…

Machine Learning · Computer Science 2022-02-22 Arie Glazier , Andrea Loreggia , Nicholas Mattei , Taher Rahgooy , Francesca Rossi , Brent Venable