Related papers: Diffusion-based Virtual Fixtures

DemoDiffusion: One-Shot Human Imitation using pre-trained Diffusion Policy

We propose DemoDiffusion, a simple method for enabling robots to perform manipulation tasks by imitating a single human demonstration, without requiring task-specific training or paired human-robot data. Our approach is based on two…

Robotics · Computer Science 2026-03-10 Sungjae Park , Homanga Bharadhwaj , Shubham Tulsiani

A Unified Framework for Probabilistic Dynamic-, Trajectory- and Vision-based Virtual Fixtures

Probabilistic Virtual Fixtures (VFs) enable the adaptive selection of the most suitable haptic feedback for each phase of a task, based on learned or perceived uncertainty. While keeping the human in the loop remains essential, for…

Robotics · Computer Science 2025-12-02 Maximilian Mühlbauer , Bernhard Weber , Sylvain Calinon , Freek Stulp , Alin Albu-Schäffer , João Silvério

Generating Stable Placements via Physics-guided Diffusion Models

Stably placing an object in a multi-object scene is a fundamental challenge in robotic manipulation, as placements must be penetration-free, establish precise surface contact, and result in a force equilibrium. To assess stability, existing…

Robotics · Computer Science 2025-09-29 Philippe Nadeau , Miguel Rogel , Ivan Bilić , Ivan Petrović , Jonathan Kelly

SimDiff: Simulator-constrained Diffusion Model for Physically Plausible Motion Generation

Generating physically plausible human motion is crucial for applications such as character animation and virtual reality. Existing approaches often incorporate a simulator-based motion projection layer to the diffusion process to enforce…

Computer Vision and Pattern Recognition · Computer Science 2025-09-26 Akihisa Watanabe , Jiawei Ren , Li Siyao , Yichen Peng , Erwin Wu , Edgar Simo-Serra

Multimodal Diffusion Forcing for Forceful Manipulation

Given a dataset of expert trajectories, standard imitation learning approaches typically learn a direct mapping from observations (e.g., RGB images) to actions. However, such methods often overlook the rich interplay between different…

Robotics · Computer Science 2026-04-14 Zixuan Huang , Huaidian Hou , Dmitry Berenson

Robot Shape and Location Retention in Video Generation Using Diffusion Models

Diffusion models have marked a significant milestone in the enhancement of image and video generation technologies. However, generating videos that precisely retain the shape and location of moving objects such as robots remains a…

Robotics · Computer Science 2024-07-04 Peng Wang , Zhihao Guo , Abdul Latheef Sait , Minh Huy Pham

X-Diffusion: Training Diffusion Policies on Cross-Embodiment Human Demonstrations

Human videos are a scalable source of training data for robot learning. However, humans and robots significantly differ in embodiment, making many human actions infeasible for direct execution on a robot. Still, these demonstrations convey…

Robotics · Computer Science 2026-04-16 Maximus A. Pace , Prithwish Dan , Chuanruo Ning , Atiksh Bhardwaj , Audrey Du , Edward W. Duan , Wei-Chiu Ma , Kushal Kedia

VLM-TDP: VLM-guided Trajectory-conditioned Diffusion Policy for Robust Long-Horizon Manipulation

Diffusion policy has demonstrated promising performance in the field of robotic manipulation. However, its effectiveness has been primarily limited in short-horizon tasks, and its performance significantly degrades in the presence of image…

Robotics · Computer Science 2025-07-08 Kefeng Huang , Tingguang Li , Yuzhen Liu , Zhe Zhang , Jiankun Wang , Lei Han

Dreamix: Video Diffusion Models are General Video Editors

Text-driven image and video diffusion models have recently achieved unprecedented generation realism. While diffusion models have been successfully applied for image editing, very few works have done so for video editing. We present the…

Computer Vision and Pattern Recognition · Computer Science 2023-02-03 Eyal Molad , Eliahu Horwitz , Dani Valevski , Alex Rav Acha , Yossi Matias , Yael Pritch , Yaniv Leviathan , Yedid Hoshen

Envision: Embodied Visual Planning via Goal-Imagery Video Diffusion

Embodied visual planning aims to enable manipulation tasks by imagining how a scene evolves toward a desired goal and using the imagined trajectories to guide actions. Video diffusion models, through their image-to-video generation…

Computer Vision and Pattern Recognition · Computer Science 2025-12-30 Yuming Gu , Yizhi Wang , Yining Hong , Yipeng Gao , Hao Jiang , Angtian Wang , Bo Liu , Nathaniel S. Dennler , Zhengfei Kuang , Hao Li , Gordon Wetzstein , Chongyang Ma

Diffusion Models for Robotic Manipulation: A Survey

Diffusion generative models have demonstrated remarkable success in visual domains such as image and video generation. They have also recently emerged as a promising approach in robotics, especially in robot manipulations. Diffusion models…

Robotics · Computer Science 2025-07-15 Rosa Wolf , Yitian Shi , Sheng Liu , Rania Rayyes

Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation

Animating virtual avatars to make co-speech gestures facilitates various applications in human-machine interaction. The existing methods mainly rely on generative adversarial networks (GANs), which typically suffer from notorious mode…

Computer Vision and Pattern Recognition · Computer Science 2023-03-21 Lingting Zhu , Xian Liu , Xuanyu Liu , Rui Qian , Ziwei Liu , Lequan Yu

StableVideo: Text-driven Consistency-aware Diffusion Video Editing

Diffusion-based methods can generate realistic images and videos, but they struggle to edit existing objects in a video while preserving their appearance over time. This prevents diffusion models from being applied to natural video editing…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Wenhao Chai , Xun Guo , Gaoang Wang , Yan Lu

ReorientDiff: Diffusion Model based Reorientation for Object Manipulation

The ability to manipulate objects in a desired configurations is a fundamental requirement for robots to complete various practical applications. While certain goals can be achieved by picking and placing the objects of interest directly,…

Robotics · Computer Science 2023-09-18 Utkarsh A. Mishra , Yongxin Chen

Diffusion Policy: Visuomotor Policy Learning via Action Diffusion

This paper introduces Diffusion Policy, a new way of generating robot behavior by representing a robot's visuomotor policy as a conditional denoising diffusion process. We benchmark Diffusion Policy across 12 different tasks from 4…

Robotics · Computer Science 2024-03-15 Cheng Chi , Zhenjia Xu , Siyuan Feng , Eric Cousineau , Yilun Du , Benjamin Burchfiel , Russ Tedrake , Shuran Song

Learning Diffusion Policies from Demonstrations For Compliant Contact-rich Manipulation

Robots hold great promise for performing repetitive or hazardous tasks, but achieving human-like dexterity, especially in contact-rich and dynamic environments, remains challenging. Rigid robots, which rely on position or velocity control,…

Robotics · Computer Science 2024-10-28 Malek Aburub , Cristian C. Beltran-Hernandez , Tatsuya Kamijo , Masashi Hamaya

Multi-Robot Control Using Time-Varying Density Functions

This paper presents an approach to externally influencing a team of robots by means of time-varying density functions. These density functions represent rough references for where the robots should be located. To this end, a continuous-time…

Optimization and Control · Mathematics 2014-04-02 Sung G. Lee , Magnus Egerstedt

EL3DD: Extended Latent 3D Diffusion for Language Conditioned Multitask Manipulation

Acting in human environments is a crucial capability for general-purpose robots, necessitating a robust understanding of natural language and its application to physical tasks. This paper seeks to harness the capabilities of diffusion…

Robotics · Computer Science 2026-04-28 Jonas Bode , Raphael Memmesheimer , Sven Behnke

3D Flow Diffusion Policy: Visuomotor Policy Learning via Generating Flow in 3D Space

Learning robust visuomotor policies that generalize across diverse objects and interaction dynamics remains a central challenge in robotic manipulation. Most existing approaches rely on direct observation-to-action mappings or compress…

Robotics · Computer Science 2025-09-24 Sangjun Noh , Dongwoo Nam , Kangmin Kim , Geonhyup Lee , Yeonguk Yu , Raeyoung Kang , Kyoobin Lee

AffordDP: Generalizable Diffusion Policy with Transferable Affordance

Diffusion-based policies have shown impressive performance in robotic manipulation tasks while struggling with out-of-domain distributions. Recent efforts attempted to enhance generalization by improving the visual feature encoding for…

Robotics · Computer Science 2025-03-21 Shijie Wu , Yihang Zhu , Yunao Huang , Kaizhen Zhu , Jiayuan Gu , Jingyi Yu , Ye Shi , Jingya Wang