Related papers: DiffClone: Enhanced Behaviour Cloning in Robotics …

Integrating Diffusion-based Multi-task Learning with Online Reinforcement Learning for Robust Quadruped Robot Control

Recent research has highlighted the powerful capabilities of imitation learning in robotics. Leveraging generative models, particularly diffusion models, these approaches offer notable advantages such as strong multi-task generalization,…

Robotics · Computer Science 2025-09-15 Xinyao Qin , Xiaoteng Ma , Yang Qi , Qihan Liu , Chuanyi Xue , Ning Gui , Qinyu Dong , Jun Yang , Bin Liang

DiffuseLoco: Real-Time Legged Locomotion Control with Diffusion from Offline Datasets

This work introduces DiffuseLoco, a framework for training multi-skill diffusion-based policies for dynamic legged locomotion from offline datasets, enabling real-time control of diverse skills on robots in the real world. Offline learning…

Robotics · Computer Science 2024-05-01 Xiaoyu Huang , Yufeng Chi , Ruofeng Wang , Zhongyu Li , Xue Bin Peng , Sophia Shao , Borivoje Nikolic , Koushil Sreenath

Adaptive Behavior Cloning Regularization for Stable Offline-to-Online Reinforcement Learning

Offline reinforcement learning, by learning from a fixed dataset, makes it possible to learn agent behaviors without interacting with the environment. However, depending on the quality of the offline dataset, such pre-trained agents may…

Machine Learning · Computer Science 2022-10-26 Yi Zhao , Rinu Boney , Alexander Ilin , Juho Kannala , Joni Pajarinen

DiffPoGAN: Diffusion Policies with Generative Adversarial Networks for Offline Reinforcement Learning

Offline reinforcement learning (RL) can learn optimal policies from pre-collected offline datasets without interacting with the environment, but the sampled actions of the agent cannot often cover the action distribution under a given…

Machine Learning · Computer Science 2024-06-14 Xuemin Hu , Shen Li , Yingfen Xu , Bo Tang , Long Chen

Dynamic Rank Adjustment in Diffusion Policies for Efficient and Flexible Training

Diffusion policies trained via offline behavioral cloning have recently gained traction in robotic motion generation. While effective, these policies typically require a large number of trainable parameters. This model size affords powerful…

Robotics · Computer Science 2025-04-29 Xiatao Sun , Shuo Yang , Yinxing Chen , Francis Fan , Yiyan Liang , Daniel Rakita

Train Offline, Test Online: A Real Robot Learning Benchmark

Three challenges limit the progress of robot learning research: robots are expensive (few labs can participate), everyone uses different robots (findings do not generalize across labs), and we lack internet-scale robotics data. We take on…

Robotics · Computer Science 2023-07-04 Gaoyue Zhou , Victoria Dean , Mohan Kumar Srirama , Aravind Rajeswaran , Jyothish Pari , Kyle Hatch , Aryan Jain , Tianhe Yu , Pieter Abbeel , Lerrel Pinto , Chelsea Finn , Abhinav Gupta

Improving Behavioural Cloning with Positive Unlabeled Learning

Learning control policies offline from pre-recorded datasets is a promising avenue for solving challenging real-world problems. However, available datasets are typically of mixed quality, with a limited number of the trajectories that we…

Machine Learning · Computer Science 2023-09-22 Qiang Wang , Robert McCarthy , David Cordova Bulens , Kevin McGuinness , Noel E. O'Connor , Nico Gürtler , Felix Widmaier , Francisco Roldan Sanchez , Stephen J. Redmond

Diffusion Policy Policy Optimization

We introduce Diffusion Policy Policy Optimization, DPPO, an algorithmic framework including best practices for fine-tuning diffusion-based policies (e.g. Diffusion Policy) in continuous control and robot learning tasks using the policy…

Robotics · Computer Science 2024-12-11 Allen Z. Ren , Justin Lidard , Lars L. Ankile , Anthony Simeonov , Pulkit Agrawal , Anirudha Majumdar , Benjamin Burchfiel , Hongkai Dai , Max Simchowitz

Enabling Stateful Behaviors for Diffusion-based Policy Learning

While imitation learning provides a simple and effective framework for policy learning, acquiring consistent actions during robot execution remains a challenging task. Existing approaches primarily focus on either modifying the action…

Robotics · Computer Science 2024-07-24 Xiao Liu , Fabian Weigend , Yifan Zhou , Heni Ben Amor

Know Your Boundaries: The Necessity of Explicit Behavioral Cloning in Offline RL

We introduce an offline reinforcement learning (RL) algorithm that explicitly clones a behavior policy to constrain value learning. In offline RL, it is often important to prevent a policy from selecting unobserved actions, since the…

Machine Learning · Computer Science 2022-06-03 Wonjoon Goo , Scott Niekum

A Closer Look at Advantage-Filtered Behavioral Cloning in High-Noise Datasets

Recent Offline Reinforcement Learning methods have succeeded in learning high-performance policies from fixed datasets of experience. A particularly effective approach learns to first identify and then mimic optimal decision-making…

Machine Learning · Computer Science 2023-12-12 Jake Grigsby , Yanjun Qi

RoLD: Robot Latent Diffusion for Multi-task Policy Modeling

Modeling generalized robot control policies poses ongoing challenges for language-guided robot manipulation tasks. Existing methods often struggle to efficiently utilize cross-dataset resources or rely on resource-intensive vision-language…

Robotics · Computer Science 2024-11-05 Wenhui Tan , Bei Liu , Junbo Zhang , Ruihua Song , Jianlong Fu

Steering Your Diffusion Policy with Latent Space Reinforcement Learning

Robotic control policies learned from human demonstrations have achieved impressive results in many real-world applications. However, in scenarios where initial performance is not satisfactory, as is often the case in novel open-world…

Robotics · Computer Science 2025-06-27 Andrew Wagenmaker , Mitsuhiko Nakamoto , Yunchu Zhang , Seohong Park , Waleed Yagoub , Anusha Nagabandi , Abhishek Gupta , Sergey Levine

Identifying Expert Behavior in Offline Training Datasets Improves Behavioral Cloning of Robotic Manipulation Policies

This paper presents our solution for the Real Robot Challenge (RRC) III, a competition featured in the NeurIPS 2022 Competition Track, aimed at addressing dexterous robotic manipulation tasks through learning from pre-collected offline…

Robotics · Computer Science 2023-09-22 Qiang Wang , Robert McCarthy , David Cordova Bulens , Francisco Roldan Sanchez , Kevin McGuinness , Noel E. O'Connor , Stephen J. Redmond

Evaluation-Time Policy Switching for Offline Reinforcement Learning

Offline reinforcement learning (RL) looks at learning how to optimally solve tasks using a fixed dataset of interactions from the environment. Many off-policy algorithms developed for online learning struggle in the offline setting as they…

Machine Learning · Computer Science 2025-03-18 Natinael Solomon Neggatu , Jeremie Houssineau , Giovanni Montana

Adaptive Diffusion Policy Optimization for Robotic Manipulation

Recent studies have shown the great potential of diffusion models in improving reinforcement learning (RL) by modeling complex policies, expressing a high degree of multi-modality, and efficiently handling high-dimensional continuous…

Robotics · Computer Science 2025-05-14 Huiyun Jiang , Zhuang Yang

Improving TD3-BC: Relaxed Policy Constraint for Offline Learning and Stable Online Fine-Tuning

The ability to discover optimal behaviour from fixed data sets has the potential to transfer the successes of reinforcement learning (RL) to domains where data collection is acutely problematic. In this offline setting, a key challenge is…

Machine Learning · Computer Science 2022-11-23 Alex Beeson , Giovanni Montana

DiffStitch: Boosting Offline Reinforcement Learning with Diffusion-based Trajectory Stitching

In offline reinforcement learning (RL), the performance of the learned policy highly depends on the quality of offline datasets. However, in many cases, the offline dataset contains very limited optimal trajectories, which poses a challenge…

Machine Learning · Computer Science 2024-02-23 Guanghe Li , Yixiang Shan , Zhengbang Zhu , Ting Long , Weinan Zhang

Beyond Imitation: Reinforcement Learning Fine-Tuning for Adaptive Diffusion Navigation Policies

Diffusion-based robot navigation policies trained on large-scale imitation learning datasets, can generate multi-modal trajectories directly from the robot's visual observations, bypassing the traditional localization-mapping-planning…

Robotics · Computer Science 2026-03-16 Junhe Sheng , Ruofei Bai , Kuan Xu , Ruimeng Liu , Jie Chen , Shenghai Yuan , Wei-Yun Yau , Lihua Xie

DiWA: Diffusion Policy Adaptation with World Models

Fine-tuning diffusion policies with reinforcement learning (RL) presents significant challenges. The long denoising sequence for each action prediction impedes effective reward propagation. Moreover, standard RL methods require millions of…

Robotics · Computer Science 2025-08-06 Akshay L Chandra , Iman Nematollahi , Chenguang Huang , Tim Welschehold , Wolfram Burgard , Abhinav Valada