Related papers: Implicit Behavioral Cloning

Towards Improving Learning from Demonstration Algorithms via MCMC Methods

Behavioral cloning, or more broadly, learning from demonstrations (LfD) is a priomising direction for robot policy learning in complex scenarios. Albeit being straightforward to implement and data-efficient, behavioral cloning has its own…

Robotics · Computer Science 2024-05-27 Carl Qi , Edward Sun , Harry Zhang

Revisiting Energy Based Models as Policies: Ranking Noise Contrastive Estimation and Interpolating Energy Models

A crucial design decision for any robot learning pipeline is the choice of policy representation: what type of model should be used to generate the next set of robot actions? Owing to the inherent multi-modal nature of many robotic tasks,…

Robotics · Computer Science 2023-09-13 Sumeet Singh , Stephen Tu , Vikas Sindhwani

Controlling Steering with Energy-Based Models

So-called implicit behavioral cloning with energy-based models has shown promising results in robotic manipulation tasks. We tested if the method's advantages carry on to controlling the steering of a real self-driving car with an…

Robotics · Computer Science 2023-06-27 Mikita Balesni , Ardi Tampuu , Tambet Matiisen

Conditional Energy-Based Models for Implicit Policies: The Gap between Theory and Practice

We present our findings in the gap between theory and practice of using conditional energy-based models (EBM) as an implicit representation for behavior-cloned policies. We also clarify several subtle, and potentially confusing, details in…

Robotics · Computer Science 2022-07-14 Duy-Nguyen Ta , Eric Cousineau , Huihua Zhao , Siyuan Feng

When Should We Prefer Offline Reinforcement Learning Over Behavioral Cloning?

Offline reinforcement learning (RL) algorithms can acquire effective policies by utilizing previously collected experience, without any online interaction. It is widely understood that offline RL is able to extract good policies even from…

Machine Learning · Computer Science 2022-04-13 Aviral Kumar , Joey Hong , Anikait Singh , Sergey Levine

Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL

We introduce an offline reinforcement learning (RL) algorithm that explicitly clones a behavior policy to constrain value learning. In offline RL, it is often important to prevent a policy from selecting unobserved actions, since the…

Machine Learning · Computer Science 2022-06-03 Wonjoon Goo , Scott Niekum

Improving Behavioural Cloning with Positive Unlabeled Learning

Learning control policies offline from pre-recorded datasets is a promising avenue for solving challenging real-world problems. However, available datasets are typically of mixed quality, with a limited number of the trajectories that we…

Machine Learning · Computer Science 2023-09-22 Qiang Wang , Robert McCarthy , David Cordova Bulens , Kevin McGuinness , Noel E. O'Connor , Nico Gürtler , Felix Widmaier , Francisco Roldan Sanchez , Stephen J. Redmond

Coherent Soft Imitation Learning

Imitation learning methods seek to learn from an expert either through behavioral cloning (BC) of the policy or inverse reinforcement learning (IRL) of the reward. Such methods enable agents to learn complex tasks from humans that are…

Machine Learning · Computer Science 2023-12-07 Joe Watson , Sandy H. Huang , Nicolas Heess

Implicit Offline Reinforcement Learning via Supervised Learning

Offline Reinforcement Learning (RL) via Supervised Learning is a simple and effective way to learn robotic skills from a dataset collected by policies of different expertise levels. It is as simple as supervised learning and Behavior…

Machine Learning · Statistics 2022-10-25 Alexandre Piche , Rafael Pardinas , David Vazquez , Igor Mordatch , Chris Pal

Keep Doing What Worked: Behavioral Modelling Priors for Offline Reinforcement Learning

Off-policy reinforcement learning algorithms promise to be applicable in settings where only a fixed data-set (batch) of environment interactions is available and no new experience can be acquired. This property makes these algorithms…

Machine Learning · Computer Science 2020-06-18 Noah Y. Siegel , Jost Tobias Springenberg , Felix Berkenkamp , Abbas Abdolmaleki , Michael Neunert , Thomas Lampe , Roland Hafner , Nicolas Heess , Martin Riedmiller

Exploring multimodal implicit behavior learning for vehicle navigation in simulated cities

Standard Behavior Cloning (BC) fails to learn multimodal driving decisions, where multiple valid actions exist for the same scenario. We explore Implicit Behavioral Cloning (IBC) with Energy-Based Models (EBMs) to better capture this…

Machine Learning · Computer Science 2025-09-22 Eric Aislan Antonelo , Gustavo Claudio Karl Couto , Christian Möller

Latent Policy Steering with Embodiment-Agnostic Pretrained World Models

The performance of learned robot visuomotor policies is heavily dependent on the size and quality of the training dataset. Although large-scale robot and human datasets are increasingly available, embodiment gaps and mismatched action…

Robotics · Computer Science 2026-03-24 Yiqi Wang , Mrinal Verghese , Jeff Schneider

Error Bounds of Imitating Policies and Environments

Imitation learning trains a policy by mimicking expert demonstrations. Various imitation methods were proposed and empirically evaluated, meanwhile, their theoretical understanding needs further studies. In this paper, we firstly analyze…

Machine Learning · Computer Science 2020-10-23 Tian Xu , Ziniu Li , Yang Yu

Model-based Behavioral Cloning with Future Image Similarity Learning

We present a visual imitation learning framework that enables learning of robot action policies solely based on expert samples without any robot trials. Robot exploration and on-policy trials in a real-world environment could often be…

Robotics · Computer Science 2019-10-09 Alan Wu , AJ Piergiovanni , Michael S. Ryoo

IMLE Policy: Fast and Sample Efficient Visuomotor Policy Learning via Implicit Maximum Likelihood Estimation

Recent advances in imitation learning, particularly using generative modelling techniques like diffusion, have enabled policies to capture complex multi-modal action distributions. However, these methods often require large datasets and…

Robotics · Computer Science 2025-03-12 Krishan Rana , Robert Lee , David Pershouse , Niko Suenderhauf

Learning Transparent Reward Models via Unsupervised Feature Selection

In complex real-world tasks such as robotic manipulation and autonomous driving, collecting expert demonstrations is often more straightforward than specifying precise learning objectives and task descriptions. Learning from expert data can…

Robotics · Computer Science 2025-05-05 Daulet Baimukashev , Gokhan Alcan , Kevin Sebastian Luck , Ville Kyrki

Model Predictive Control via On-Policy Imitation Learning

In this paper, we leverage the rapid advances in imitation learning, a topic of intense recent focus in the Reinforcement Learning (RL) literature, to develop new sample complexity results and performance guarantees for data-driven Model…

Optimization and Control · Mathematics 2022-10-18 Kwangjun Ahn , Zakaria Mhammedi , Horia Mania , Zhang-Wei Hong , Ali Jadbabaie

The Extrapolation Power of Implicit Models

In this paper, we investigate the extrapolation capabilities of implicit deep learning models in handling unobserved data, where traditional deep neural networks may falter. Implicit models, distinguished by their adaptability in layer…

Machine Learning · Computer Science 2024-07-22 Juliette Decugis , Alicia Y. Tsai , Max Emerling , Ashwin Ganesh , Laurent El Ghaoui

Efficient Preference-Based Reinforcement Learning Using Learned Dynamics Models

Preference-based reinforcement learning (PbRL) can enable robots to learn to perform tasks based on an individual's preferences without requiring a hand-crafted reward function. However, existing approaches either assume access to a…

Machine Learning · Computer Science 2024-02-13 Yi Liu , Gaurav Datta , Ellen Novoseller , Daniel S. Brown

Imitating Unknown Policies via Exploration

Behavioral cloning is an imitation learning technique that teaches an agent how to behave through expert demonstrations. Recent approaches use self-supervision of fully-observable unlabeled snapshots of the states to decode state-pairs into…

Machine Learning · Computer Science 2020-08-14 Nathan Gavenski , Juarez Monteiro , Roger Granada , Felipe Meneguzzi , Rodrigo C. Barros