Related papers: ReorientDiff: Diffusion Model based Reorientation …

ReorientBot: Learning Object Reorientation for Specific-Posed Placement

Robots need the capability of placing objects in arbitrary, specific poses to rearrange the world and achieve various valuable tasks. Object reorientation plays a crucial role in this as objects may not initially be oriented such that the…

Robotics · Computer Science 2022-02-23 Kentaro Wada , Stephen James , Andrew J. Davison

StructDiffusion: Language-Guided Creation of Physically-Valid Structures using Unseen Objects

Robots operating in human environments must be able to rearrange objects into semantically-meaningful configurations, even if these objects are previously unseen. In this work, we focus on the problem of building physically-valid structures…

Robotics · Computer Science 2023-04-26 Weiyu Liu , Yilun Du , Tucker Hermans , Sonia Chernova , Chris Paxton

Diff9D: Diffusion-Based Domain-Generalized Category-Level 9-DoF Object Pose Estimation

Nine-degrees-of-freedom (9-DoF) object pose and size estimation is crucial for enabling augmented reality and robotic manipulation. Category-level methods have received extensive research attention due to their potential for generalization…

Computer Vision and Pattern Recognition · Computer Science 2025-03-18 Jian Liu , Wei Sun , Hui Yang , Pengchao Deng , Chongpei Liu , Nicu Sebe , Hossein Rahmani , Ajmal Mian

LVDiffusor: Distilling Functional Rearrangement Priors from Large Models into Diffusor

Object rearrangement, a fundamental challenge in robotics, demands versatile strategies to handle diverse objects, configurations, and functional needs. To achieve this, the AI robot needs to learn functional rearrangement priors in order…

Robotics · Computer Science 2024-03-11 Yiming Zeng , Mingdong Wu , Long Yang , Jiyao Zhang , Hao Ding , Hui Cheng , Hao Dong

DefFusionNet: Learning Multimodal Goal Shapes for Deformable Object Manipulation via a Diffusion-based Probabilistic Model

Deformable object manipulation is critical to many real-world robotic applications, ranging from surgical robotics and soft material handling in manufacturing to household tasks like laundry folding. At the core of this important robotic…

Robotics · Computer Science 2025-06-24 Bao Thach , Siyeon Kim , Britton Jordan , Mohanraj Shanthi , Tanner Watts , Shing-Hei Ho , James M. Ferguson , Tucker Hermans , Alan Kuntz

DexHandDiff: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation

Dexterous manipulation with contact-rich interactions is crucial for advanced robotics. While recent diffusion-based planning approaches show promise for simple manipulation tasks, they often produce unrealistic ghost states (e.g., the…

Robotics · Computer Science 2025-06-18 Zhixuan Liang , Yao Mu , Yixiao Wang , Tianxing Chen , Wenqi Shao , Wei Zhan , Masayoshi Tomizuka , Ping Luo , Mingyu Ding

A Diffusion-Based Framework for Occluded Object Movement

Seamlessly moving objects within a scene is a common requirement for image editing, but it is still a challenge for existing editing methods. Especially for real-world images, the occlusion situation further increases the difficulty. The…

Computer Vision and Pattern Recognition · Computer Science 2025-04-03 Zheng-Peng Duan , Jiawei Zhang , Siyu Liu , Zheng Lin , Chun-Le Guo , Dongqing Zou , Jimmy Ren , Chongyi Li

ImitDiff: Transferring Foundation-Model Priors for Distraction Robust Visuomotor Policy

Visuomotor imitation learning policies enable robots to efficiently acquire manipulation skills from visual demonstrations. However, as scene complexity and visual distractions increase, policies that perform well in simple settings often…

Artificial Intelligence · Computer Science 2025-11-11 Yuhang Dong , Haizhou Ge , Yupei Zeng , Jiangning Zhang , Beiwen Tian , Hongrui Zhu , Yufei Jia , Ruixiang Wang , Zhucun Xue , Guyue Zhou , Longhua Ma , Guanzhong Tian

Diffusion Models for Robotic Manipulation: A Survey

Diffusion generative models have demonstrated remarkable success in visual domains such as image and video generation. They have also recently emerged as a promising approach in robotics, especially in robot manipulations. Diffusion models…

Robotics · Computer Science 2025-07-15 Rosa Wolf , Yitian Shi , Sheng Liu , Rania Rayyes

PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control

We present PoseDiff, a conditional diffusion model that unifies robot state estimation and control within a single framework. At its core, PoseDiff maps raw visual observations into structured robot states-such as 3D keypoints or joint…

Robotics · Computer Science 2025-11-03 Haozhuo Zhang , Michele Caprio , Jing Shao , Qiang Zhang , Jian Tang , Shanghang Zhang , Wei Pan

RobotDiffuse: Diffusion-Based Motion Planning for Redundant Manipulators with the ROP Obstacle Avoidance Dataset

Redundant manipulators, with their higher Degrees of Freedom (DoFs), offer enhanced kinematic performance and versatility, making them suitable for applications like manufacturing, surgical robotics, and human-robot collaboration. However,…

Robotics · Computer Science 2026-01-07 Xudong Mou , Xiaohan Zhang , Tiejun Wang , Tianyu Wo , Cangbai Xu , Ningbo Gu , Rui Wang , Xudong Liu

Consistent Image Layout Editing with Diffusion Models

Despite the great success of large-scale text-to-image diffusion models in image generation and image editing, existing methods still struggle to edit the layout of real images. Although a few works have been proposed to tackle this…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Tao Xia , Yudi Zhang , Ting Liu Lei Zhang

DiffRetouch: Using Diffusion to Retouch on the Shoulder of Experts

Image retouching aims to enhance the visual quality of photos. Considering the different aesthetic preferences of users, the target of retouching is subjective. However, current retouching methods mostly adopt deterministic models, which…

Computer Vision and Pattern Recognition · Computer Science 2024-07-08 Zheng-Peng Duan , Jiawei zhang , Zheng Lin , Xin Jin , Dongqing Zou , Chunle Guo , Chongyi Li

Efficient Object Rearrangement via Multi-view Fusion

The prospect of assistive robots aiding in object organization has always been compelling. In an image-goal setting, the robot rearranges the current scene to match the single image captured from the goal scene. The key to an image-goal…

Robotics · Computer Science 2023-09-19 Dehao Huang , Chao Tang , Hong Zhang

Learning Coordinated Bimanual Manipulation Policies using State Diffusion and Inverse Dynamics Models

When performing tasks like laundry, humans naturally coordinate both hands to manipulate objects and anticipate how their actions will change the state of the clothes. However, achieving such coordination in robotics remains challenging due…

Robotics · Computer Science 2025-04-01 Haonan Chen , Jiaming Xu , Lily Sheng , Tianchen Ji , Shuijing Liu , Yunzhu Li , Katherine Driggs-Campbell

Diffusion-Based Imaginative Coordination for Bimanual Manipulation

Bimanual manipulation is crucial in robotics, enabling complex tasks in industrial automation and household services. However, it poses significant challenges due to the high-dimensional action space and intricate coordination requirements.…

Robotics · Computer Science 2025-07-16 Huilin Xu , Jian Ding , Jiakun Xu , Ruixiang Wang , Jun Chen , Jinjie Mai , Yanwei Fu , Bernard Ghanem , Feng Xu , Mohamed Elhoseiny

Non-differentiable Reward Optimization for Diffusion-based Autonomous Motion Planning

Safe and effective motion planning is crucial for autonomous robots. Diffusion models excel at capturing complex agent interactions, a fundamental aspect of decision-making in dynamic environments. Recent studies have successfully applied…

Robotics · Computer Science 2025-07-18 Giwon Lee , Daehee Park , Jaewoo Jeong , Kuk-Jin Yoon

AlignDiff: Learning Physically-Grounded Camera Alignment via Diffusion

Accurate camera calibration is a fundamental task for 3D perception, especially when dealing with real-world, in-the-wild environments where complex optical distortions are common. Existing methods often rely on pre-rectified images or…

Computer Vision and Pattern Recognition · Computer Science 2025-03-28 Liuyue Xie , Jiancong Guo , Ozan Cakmakci , Andre Araujo , Laszlo A. Jeni , Zhiheng Jia

Motion Planning Diffusion: Learning and Planning of Robot Motions with Diffusion Models

Learning priors on trajectory distributions can help accelerate robot motion planning optimization. Given previously successful plans, learning trajectory generative models as priors for a new planning problem is highly desirable. Prior…

Robotics · Computer Science 2024-03-27 Joao Carvalho , An T. Le , Mark Baierl , Dorothea Koert , Jan Peters

ReVersion: Diffusion-Based Relation Inversion from Images

Diffusion models gain increasing popularity for their generative capabilities. Recently, there have been surging needs to generate customized images by inverting diffusion models from exemplar images, and existing inversion methods mainly…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Ziqi Huang , Tianxing Wu , Yuming Jiang , Kelvin C. K. Chan , Ziwei Liu