Related papers: Less is More: Improving Motion Diffusion Models wi…

Enhanced Fine-grained Motion Diffusion for Text-driven Human Motion Synthesis

The emergence of text-driven motion synthesis technique provides animators with great potential to create efficiently. However, in most cases, textual expressions only contain general and qualitative motion descriptions, while lack fine…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Dong Wei , Xiaoning Sun , Huaijiang Sun , Bin Li , Shengxiang Hu , Weiqing Li , Jianfeng Lu

Flexible Motion In-betweening with Diffusion Models

Motion in-betweening, a fundamental task in character animation, consists of generating motion sequences that plausibly interpolate user-provided keyframe constraints. It has long been recognized as a labor-intensive and challenging…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Setareh Cohan , Guy Tevet , Daniele Reda , Xue Bin Peng , Michiel van de Panne

Guided Motion Diffusion for Controllable Human Motion Synthesis

Denoising diffusion models have shown great promise in human motion synthesis conditioned on natural language descriptions. However, integrating spatial constraints, such as pre-defined motion trajectories and obstacles, remains a challenge…

Computer Vision and Pattern Recognition · Computer Science 2023-10-31 Korrawe Karunratanakul , Konpat Preechakul , Supasorn Suwajanakorn , Siyu Tang

TalkingPose: Efficient Face and Gesture Animation with Feedback-guided Diffusion Model

Recent advancements in diffusion models have significantly improved the realism and generalizability of character-driven animation, enabling the synthesis of high-quality motion from just a single RGB image and a set of driving poses.…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Alireza Javanmardi , Pragati Jaiswal , Tewodros Amberbir Habtegebrial , Christen Millerdurai , Shaoxiang Wang , Alain Pagani , Didier Stricker

Diffusion Prism: Enhancing Diversity and Morphology Consistency in Mask-to-Image Diffusion

The emergence of generative AI and controllable diffusion has made image-to-image synthesis increasingly practical and efficient. However, when input images exhibit low entropy and sparse, the inherent characteristics of diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2025-01-14 Hao Wang , Xiwen Chen , Ashish Bastola , Jiayou Qin , Abolfazl Razi

Generative Inbetweening: Adapting Image-to-Video Models for Keyframe Interpolation

We present a method for generating video sequences with coherent motion between a pair of input key frames. We adapt a pretrained large-scale image-to-video diffusion model (originally trained to generate videos moving forward in time from…

Computer Vision and Pattern Recognition · Computer Science 2025-02-13 Xiaojuan Wang , Boyang Zhou , Brian Curless , Ira Kemelmacher-Shlizerman , Aleksander Holynski , Steven M. Seitz

When Less Is More: A Sparse Facial Motion Structure For Listening Motion Learning

Effective human behavior modeling is critical for successful human-robot interaction. Current state-of-the-art approaches for predicting listening head behavior during dyadic conversations employ continuous-to-discrete representations,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-09 Tri Tung Nguyen Nguyen , Quang Tien Dam , Dinh Tuan Tran , Joo-Ho Lee

Generative Motion Infilling From Imprecisely Timed Keyframes

Keyframes are a standard representation for kinematic motion specification. Recent learned motion-inbetweening methods use keyframes as a way to control generative motion models, and are trained to generate life-like motion that matches the…

Graphics · Computer Science 2025-03-04 Purvi Goel , Haotian Zhang , C. Karen Liu , Kayvon Fatahalian

Learning Sparse Masks for Diffusion-based Image Inpainting

Diffusion-based inpainting is a powerful tool for the reconstruction of images from sparse data. Its quality strongly depends on the choice of known data. Optimising their spatial location -- the inpainting mask -- is challenging. A…

Image and Video Processing · Electrical Eng. & Systems 2022-05-17 Tobias Alt , Pascal Peter , Joachim Weickert

Motion Synthesis with Sparse and Flexible Keyjoint Control

Creating expressive character animations is labor-intensive, requiring intricate manual adjustment of animators across space and time. Previous works on controllable motion generation often rely on a predefined set of dense spatio-temporal…

Graphics · Computer Science 2025-07-28 Inwoo Hwang , Jinseok Bae , Donggeun Lim , Young Min Kim

Generative Motion In-betweening by Diffusion over Continuous Implicit Representations

Recent advances in generative models have yielded impressive progress on motion in-betweening, allowing for more complex, varied, and realistic motion transitions. However, recent methods still exhibit noticeable limitations in preserving…

Graphics · Computer Science 2026-05-14 Shiyu Fan , Paul Henderson , Edmond S. L. Ho

A Cheaper and Better Diffusion Language Model with Soft-Masked Noise

Diffusion models that are based on iterative denoising have been recently proposed and leveraged in various generation tasks like image generation. Whereas, as a way inherently built for continuous data, existing diffusion models still have…

Computation and Language · Computer Science 2023-04-11 Jiaao Chen , Aston Zhang , Mu Li , Alex Smola , Diyi Yang

MotionMix: Weakly-Supervised Diffusion for Controllable Motion Generation

Controllable generation of 3D human motions becomes an important topic as the world embraces digital transformation. Existing works, though making promising progress with the advent of diffusion models, heavily rely on meticulously captured…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Nhat M. Hoang , Kehong Gong , Chuan Guo , Michael Bi Mi

DiMo: Discrete Diffusion Modeling for Motion Generation and Understanding

Prior masked modeling motion generation methods predominantly study text-to-motion. We present DiMo, a discrete diffusion-style framework, which extends masked modeling to bidirectional text--motion understanding and generation. Unlike…

Computer Vision and Pattern Recognition · Computer Science 2026-02-09 Ning Zhang , Zhengyu Li , Kwong Weng Loh , Mingxi Xu , Qi Wang , Zhengyu Wen , Xiaoyu He , Wei Zhao , Kehong Gong , Mingyuan Zhang

Stable Video-Driven Portraits

Portrait animation aims to generate photo-realistic videos from a single source image by reenacting the expression and pose from a driving video. While early methods relied on 3D morphable models or feature warping techniques, they often…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Mallikarjun B. R. , Fei Yin , Vikram Voleti , Nikita Drobyshev , Maksim Lapin , Aaryaman Vasishta , Varun Jampani

Denoising Reuse: Exploiting Inter-frame Motion Consistency for Efficient Video Latent Generation

Video generation using diffusion-based models is constrained by high computational costs due to the frame-wise iterative diffusion process. This work presents a Diffusion Reuse MOtion (Dr. Mo) network to accelerate latent video generation.…

Computer Vision and Pattern Recognition · Computer Science 2024-09-20 Chenyu Wang , Shuo Yan , Yixuan Chen , Yujiang Wang , Mingzhi Dong , Xiaochen Yang , Dongsheng Li , Robert P. Dick , Qin Lv , Fan Yang , Tun Lu , Ning Gu , Li Shang

Accelerating Video Diffusion Models via Distribution Matching

Generative models, particularly diffusion models, have made significant success in data synthesis across various modalities, including images, videos, and 3D assets. However, current diffusion models are computationally intensive, often…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Yuanzhi Zhu , Hanshu Yan , Huan Yang , Kai Zhang , Junnan Li

Sparse2Dense: A Keypoint-driven Generative Framework for Human Video Compression and Vertex Prediction

For bandwidth-constrained multimedia applications, simultaneously achieving ultra-low bitrate human video compression and accurate vertex prediction remains a critical challenge, as it demands the harmonization of dynamic motion modeling,…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Bolin Chen , Ru-Ling Liao , Yan Ye , Jie Chen , Shanzhi Yin , Xinrui Ju , Shiqi Wang , Yibo Fan

Sparsely Supervised Diffusion

Diffusion models have shown remarkable success across a wide range of generative tasks. However, they often suffer from spatially inconsistent generation, arguably due to the inherent locality of their denoising mechanisms. This can yield…

Machine Learning · Computer Science 2026-02-04 Wenshuai Zhao , Zhiyuan Li , Yi Zhao , Mohammad Hassan Vali , Martin Trapp , Joni Pajarinen , Juho Kannala , Arno Solin

AAMDM: Accelerated Auto-regressive Motion Diffusion Model

Interactive motion synthesis is essential in creating immersive experiences in entertainment applications, such as video games and virtual reality. However, generating animations that are both high-quality and contextually responsive…

Computer Vision and Pattern Recognition · Computer Science 2024-01-15 Tianyu Li , Calvin Qiao , Guanqiao Ren , KangKang Yin , Sehoon Ha