Related papers: Interactive Policy Learning through Confidence-Bas…

Autonomous Self-Explanation of Behavior for Interactive Reinforcement Learning Agents

In cooperation, the workers must know how co-workers behave. However, an agent's policy, which is embedded in a statistical machine learning model, is hard to understand, and requires much time and knowledge to comprehend. Therefore, it is…

Artificial Intelligence · Computer Science 2018-10-23 Yosuke Fukuchi , Masahiko Osawa , Hiroshi Yamakawa , Michita Imai

Watch, Try, Learn: Meta-Learning from Demonstrations and Reward

Imitation learning allows agents to learn complex behaviors from demonstrations. However, learning a complex vision-based task may require an impractical number of demonstrations. Meta-imitation learning is a promising approach towards…

Machine Learning · Computer Science 2020-02-03 Allan Zhou , Eric Jang , Daniel Kappler , Alex Herzog , Mohi Khansari , Paul Wohlhart , Yunfei Bai , Mrinal Kalakrishnan , Sergey Levine , Chelsea Finn

Efficiently Combining Human Demonstrations and Interventions for Safe Training of Autonomous Systems in Real-Time

This paper investigates how to utilize different forms of human interaction to safely train autonomous systems in real-time by learning from both human demonstrations and interventions. We implement two components of the Cycle-of-Learning…

Artificial Intelligence · Computer Science 2018-11-30 Vinicius G. Goecks , Gregory M. Gremillion , Vernon J. Lawhern , John Valasek , Nicholas R. Waytowich

Active Probing and Influencing Human Behaviors Via Autonomous Agents

Autonomous agents (robots) face tremendous challenges while interacting with heterogeneous human agents in close proximity. One of these challenges is that the autonomous agent does not have an accurate model tailored to the specific human…

Robotics · Computer Science 2023-04-25 Shuangge Wang , Yiwei Lyu , John M. Dolan

Confidence-Aware Imitation Learning from Demonstrations with Varying Optimality

Most existing imitation learning approaches assume the demonstrations are drawn from experts who are optimal, but relaxing this assumption enables us to use a wider range of data. Standard imitation learning may learn a suboptimal policy…

Machine Learning · Computer Science 2022-01-27 Songyuan Zhang , Zhangjie Cao , Dorsa Sadigh , Yanan Sui

Learning from Imperfect Demonstrations via Adversarial Confidence Transfer

Existing learning from demonstration algorithms usually assume access to expert demonstrations. However, this assumption is limiting in many real-world applications since the collected demonstrations may be suboptimal or even consist of…

Robotics · Computer Science 2022-03-03 Zhangjie Cao , Zihan Wang , Dorsa Sadigh

Guiding Policies with Language via Meta-Learning

Behavioral skills or policies for autonomous agents are conventionally learned from reward functions, via reinforcement learning, or from demonstrations, via imitation learning. However, both modes of task specification have their…

Machine Learning · Computer Science 2019-01-30 John D. Co-Reyes , Abhishek Gupta , Suvansh Sanjeev , Nick Altieri , Jacob Andreas , John DeNero , Pieter Abbeel , Sergey Levine

Learning from Imperfect Demonstrations from Agents with Varying Dynamics

Imitation learning enables robots to learn from demonstrations. Previous imitation learning algorithms usually assume access to optimal expert demonstrations. However, in many real-world applications, this assumption is limiting. Most…

Machine Learning · Computer Science 2021-03-11 Zhangjie Cao , Dorsa Sadigh

Interactive Learning from Policy-Dependent Human Feedback

This paper investigates the problem of interactively learning behaviors communicated by a human teacher using positive and negative feedback. Much previous work on this problem has made the assumption that people provide feedback for…

Artificial Intelligence · Computer Science 2023-01-31 James MacGlashan , Mark K Ho , Robert Loftin , Bei Peng , Guan Wang , David Roberts , Matthew E. Taylor , Michael L. Littman

Online Continual Learning For Interactive Instruction Following Agents

In learning an embodied agent executing daily tasks via language directives, the literature largely assumes that the agent learns all training data at the beginning. We argue that such a learning scenario is less realistic since a robotic…

Artificial Intelligence · Computer Science 2024-03-14 Byeonghwi Kim , Minhyuk Seo , Jonghyun Choi

Feasibility-aware Imitation Learning from Observations through a Hand-mounted Demonstration Interface

Imitation learning through a demonstration interface is expected to learn policies for robot automation from intuitive human demonstrations. However, due to the differences in human and robot movement characteristics, a human expert might…

Robotics · Computer Science 2025-03-13 Kei Takahashi , Hikaru Sasaki , Takamitsu Matsubara

Confidence-Guided Human-AI Collaboration: Reinforcement Learning with Distributional Proxy Value Propagation for Autonomous Driving

Autonomous driving promises significant advancements in mobility, road safety and traffic efficiency, yet reinforcement learning and imitation learning face safe-exploration and distribution-shift challenges. Although human-AI collaboration…

Robotics · Computer Science 2025-06-06 Li Zeqiao , Wang Yijing , Wang Haoyu , Li Zheng , Li Peng , Zuo zhiqiang , Hu Chuan

ConBaT: Control Barrier Transformer for Safe Policy Learning

Large-scale self-supervised models have recently revolutionized our ability to perform a variety of tasks within the vision and language domains. However, using such models for autonomous systems is challenging because of safety…

Robotics · Computer Science 2023-03-09 Yue Meng , Sai Vemprala , Rogerio Bonatti , Chuchu Fan , Ashish Kapoor

CubeDAgger: Interactive Imitation Learning for Dynamic Systems with Efficient yet Low-risk Interaction

Interactive imitation learning makes an agent's control policy robust by stepwise supervisions from an expert. The recent algorithms mostly employ expert-agent switching systems to reduce the expert's burden by limitedly selecting the…

Robotics · Computer Science 2026-04-23 Taisuke Kobayashi

Learning When to Ask for Help: Efficient Interactive Navigation via Implicit Uncertainty Estimation

Robots operating alongside humans often encounter unfamiliar environments that make autonomous task completion challenging. Though improving models and increasing dataset size can enhance a robot's performance in unseen environments, data…

Robotics · Computer Science 2024-06-10 Ifueko Igbinedion , Sertac Karaman

Bayesian Disturbance Injection: Robust Imitation Learning of Flexible Policies for Robot Manipulation

Humans demonstrate a variety of interesting behavioral characteristics when performing tasks, such as selecting between seemingly equivalent optimal actions, performing recovery actions when deviating from the optimal trajectory, or…

Robotics · Computer Science 2022-11-08 Hanbit Oh , Hikaru Sasaki , Brendan Michael , Takamitsu Matsubara

Formal Policy Learning from Demonstrations for Reachability Properties

We consider the problem of learning structured, closed-loop policies (feedback laws) from demonstrations in order to control under-actuated robotic systems, so that formal behavioral specifications such as reaching a target set of states…

Systems and Control · Computer Science 2019-03-05 Hadi Ravanbakhsh , Sriram Sankaranarayanan , Sanjit A. Seshia

Towards Improving Learning from Demonstration Algorithms via MCMC Methods

Behavioral cloning, or more broadly, learning from demonstrations (LfD) is a priomising direction for robot policy learning in complex scenarios. Albeit being straightforward to implement and data-efficient, behavioral cloning has its own…

Robotics · Computer Science 2024-05-27 Carl Qi , Edward Sun , Harry Zhang

Learning to Seek Evidence: A Verifiable Reasoning Agent with Causal Faithfulness Analysis

Explanations for AI models in high-stakes domains like medicine often lack verifiability, which can hinder trust. To address this, we propose an interactive agent that produces explanations through an auditable sequence of actions. The…

Artificial Intelligence · Computer Science 2025-11-04 Yuhang Huang , Zekai Lin , Fan Zhong , Lei Liu

Behavior-Constrained Reinforcement Learning with Receding-Horizon Credit Assignment for High-Performance Control

Learning high-performance control policies that remain consistent with expert behavior is a fundamental challenge in robotics. Reinforcement learning can discover high-performing strategies but often departs from desirable human behavior,…

Robotics · Computer Science 2026-04-06 Siwei Ju , Jan Tauberschmidt , Oleg Arenz , Peter van Vliet , Jan Peters