Related papers: HandDiffuse: Generative Controllers for Two-Hand I…

InterHandGen: Two-Hand Interaction Generation via Cascaded Reverse Diffusion

We present InterHandGen, a novel framework that learns the generative prior of two-hand interaction. Sampling from our model yields plausible and diverse two-hand shapes in close interaction with or without an object. Our prior can be…

Computer Vision and Pattern Recognition · Computer Science 2024-03-27 Jihyun Lee , Shunsuke Saito , Giljoo Nam , Minhyuk Sung , Tae-Kyun Kim

InterGen: Diffusion-based Multi-human Motion Generation under Complex Interactions

We have recently seen tremendous progress in diffusion advances for generating realistic human motions. Yet, they largely disregard the multi-human interactions. In this paper, we present InterGen, an effective diffusion-based approach that…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Han Liang , Wenqian Zhang , Wenxuan Li , Jingyi Yu , Lan Xu

DiffH2O: Diffusion-Based Synthesis of Hand-Object Interactions from Textual Descriptions

Generating natural hand-object interactions in 3D is challenging as the resulting hand and object motions are expected to be physically plausible and semantically meaningful. Furthermore, generalization to unseen objects is hindered by the…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Sammy Christen , Shreyas Hampali , Fadime Sener , Edoardo Remelli , Tomas Hodan , Eric Sauser , Shugao Ma , Bugra Tekin

HanDiffuser: Text-to-Image Generation With Realistic Hand Appearances

Text-to-image generative models can generate high-quality humans, but realism is lost when generating hands. Common artifacts include irregular hand poses, shapes, incorrect numbers of fingers, and physically implausible finger…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Supreeth Narasimhaswamy , Uttaran Bhattacharya , Xiang Chen , Ishita Dasgupta , Saayan Mitra , Minh Hoai

HOIDiffusion: Generating Realistic 3D Hand-Object Interaction Data

3D hand-object interaction data is scarce due to the hardware constraints in scaling up the data collection process. In this paper, we propose HOIDiffusion for generating realistic and diverse 3D hand-object interaction data. Our model is a…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Mengqi Zhang , Yang Fu , Zheng Ding , Sifei Liu , Zhuowen Tu , Xiaolong Wang

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Human motion modeling is important for many modern graphics applications, which typically require professional skills. In order to remove the skill barriers for laymen, recent motion generation methods can directly generate human motions…

Computer Vision and Pattern Recognition · Computer Science 2022-09-01 Mingyuan Zhang , Zhongang Cai , Liang Pan , Fangzhou Hong , Xinying Guo , Lei Yang , Ziwei Liu

Giving a Hand to Diffusion Models: a Two-Stage Approach to Improving Conditional Human Image Generation

Recent years have seen significant progress in human image generation, particularly with the advancements in diffusion models. However, existing diffusion methods encounter challenges when producing consistent hand anatomy and the generated…

Computer Vision and Pattern Recognition · Computer Science 2024-05-01 Anton Pelykh , Ozge Mercanoglu Sincan , Richard Bowden

DIDiffGes: Decoupled Semi-Implicit Diffusion Models for Real-time Gesture Generation from Speech

Diffusion models have demonstrated remarkable synthesis quality and diversity in generating co-speech gestures. However, the computationally intensive sampling steps associated with diffusion models hinder their practicality in real-world…

Graphics · Computer Science 2025-03-24 Yongkang Cheng , Shaoli Huang , Xuelin Chen , Jifeng Ning , Mingming Gong

Interact2Ar: Full-Body Human-Human Interaction Generation via Autoregressive Diffusion Models

Generating realistic human-human interactions is a challenging task that requires not only high-quality individual body and hand motions, but also coherent coordination among all interactants. Due to limitations in available data and…

Computer Vision and Pattern Recognition · Computer Science 2026-03-30 Pablo Ruiz-Ponce , Sergio Escalera , José García-Rodríguez , Jiankang Deng , Rolandos Alexandros Potamias

M2Diffuser: Diffusion-based Trajectory Optimization for Mobile Manipulation in 3D Scenes

Recent advances in diffusion models have opened new avenues for research into embodied AI agents and robotics. Despite significant achievements in complex robotic locomotion and skills, mobile manipulation-a capability that requires the…

Robotics · Computer Science 2025-04-03 Sixu Yan , Zeyu Zhang , Muzhi Han , Zaijin Wang , Qi Xie , Zhitian Li , Zhehan Li , Hangxin Liu , Xinggang Wang , Song-Chun Zhu

Affordance Diffusion: Synthesizing Hand-Object Interactions

Recent successes in image synthesis are powered by large-scale diffusion models. However, most methods are currently limited to either text- or image-conditioned generation for synthesizing an entire image, texture transfer or inserting…

Computer Vision and Pattern Recognition · Computer Science 2023-05-23 Yufei Ye , Xueting Li , Abhinav Gupta , Shalini De Mello , Stan Birchfield , Jiaming Song , Shubham Tulsiani , Sifei Liu

Diff-IP2D: Diffusion-Based Hand-Object Interaction Prediction on Egocentric Videos

Understanding how humans would behave during hand-object interaction is vital for applications in service robot manipulation and extended reality. To achieve this, some recent works have been proposed to simultaneously forecast hand…

Computer Vision and Pattern Recognition · Computer Science 2025-11-17 Junyi Ma , Jingyi Xu , Xieyuanli Chen , Hesheng Wang

DexHandDiff: Interaction-aware Diffusion Planning for Adaptive Dexterous Manipulation

Dexterous manipulation with contact-rich interactions is crucial for advanced robotics. While recent diffusion-based planning approaches show promise for simple manipulation tasks, they often produce unrealistic ghost states (e.g., the…

Robotics · Computer Science 2025-06-18 Zhixuan Liang , Yao Mu , Yixiao Wang , Tianxing Chen , Wenqi Shao , Wei Zhan , Masayoshi Tomizuka , Ping Luo , Mingyu Ding

GraspDiffusion: Synthesizing Realistic Whole-body Hand-Object Interaction

Recent generative models can synthesize high-quality images, but they often fail to generate humans interacting with objects using their hands. This arises mostly from the model's misunderstanding of such interactions and the hardships of…

Computer Vision and Pattern Recognition · Computer Science 2025-12-01 Patrick Kwon , Chen Chen , Hanbyul Joo

Diffgrasp: Whole-Body Grasping Synthesis Guided by Object Motion Using a Diffusion Model

Generating high-quality whole-body human object interaction motion sequences is becoming increasingly important in various fields such as animation, VR/AR, and robotics. The main challenge of this task lies in determining the level of…

Computer Vision and Pattern Recognition · Computer Science 2024-12-31 Yonghao Zhang , Qiang He , Yanguang Wan , Yinda Zhang , Xiaoming Deng , Cuixia Ma , Hongan Wang

FastGrasp: Efficient Grasp Synthesis with Diffusion

Effectively modeling the interaction between human hands and objects is challenging due to the complex physical constraints and the requirement for high generation efficiency in applications. Prior approaches often employ computationally…

Robotics · Computer Science 2024-11-25 Xiaofei Wu , Tao Liu , Caoji Li , Yuexin Ma , Yujiao Shi , Xuming He

FUSION: Full-Body Unified Motion Prior for Body and Hands via Diffusion

Hands are central to interacting with our surroundings and conveying gestures, making their inclusion essential for full-body motion synthesis. Despite this, existing human motion synthesis methods fall short: some ignore hand motions…

Computer Vision and Pattern Recognition · Computer Science 2026-01-08 Enes Duran , Nikos Athanasiou , Muhammed Kocabas , Michael J. Black , Omid Taheri

HandX: Scaling Bimanual Motion and Interaction Generation

Synthesizing human motion has advanced rapidly, yet realistic hand motion and bimanual interaction remain underexplored. Whole-body models often miss the fine-grained cues that drive dexterous behavior, finger articulation, contact timing,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Zimu Zhang , Yucheng Zhang , Xiyan Xu , Ziyin Wang , Sirui Xu , Kai Zhou , Bing Zhou , Chuan Guo , Jian Wang , Yu-Xiong Wang , Liang-Yan Gui

M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models

We introduce the Multi-Motion Discrete Diffusion Models (M2D2M), a novel approach for human motion generation from textual descriptions of multiple actions, utilizing the strengths of discrete diffusion models. This approach adeptly…

Computer Vision and Pattern Recognition · Computer Science 2024-07-22 Seunggeun Chi , Hyung-gun Chi , Hengbo Ma , Nakul Agarwal , Faizan Siddiqui , Karthik Ramani , Kwonjoon Lee

Diffusion Handles: Enabling 3D Edits for Diffusion Models by Lifting Activations to 3D

Diffusion Handles is a novel approach to enabling 3D object edits on diffusion images. We accomplish these edits using existing pre-trained diffusion models, and 2D image depth estimation, without any fine-tuning or 3D object retrieval. The…

Computer Vision and Pattern Recognition · Computer Science 2023-12-08 Karran Pandey , Paul Guerrero , Matheus Gadelha , Yannick Hold-Geoffroy , Karan Singh , Niloy Mitra