English
Related papers

Related papers: Less is More: Improving Motion Diffusion Models wi…

200 papers

The emergence of text-driven motion synthesis technique provides animators with great potential to create efficiently. However, in most cases, textual expressions only contain general and qualitative motion descriptions, while lack fine…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Dong Wei , Xiaoning Sun , Huaijiang Sun , Bin Li , Shengxiang Hu , Weiqing Li , Jianfeng Lu

Motion in-betweening, a fundamental task in character animation, consists of generating motion sequences that plausibly interpolate user-provided keyframe constraints. It has long been recognized as a labor-intensive and challenging…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Setareh Cohan , Guy Tevet , Daniele Reda , Xue Bin Peng , Michiel van de Panne

Denoising diffusion models have shown great promise in human motion synthesis conditioned on natural language descriptions. However, integrating spatial constraints, such as pre-defined motion trajectories and obstacles, remains a challenge…

Computer Vision and Pattern Recognition · Computer Science 2023-10-31 Korrawe Karunratanakul , Konpat Preechakul , Supasorn Suwajanakorn , Siyu Tang

Recent advancements in diffusion models have significantly improved the realism and generalizability of character-driven animation, enabling the synthesis of high-quality motion from just a single RGB image and a set of driving poses.…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Alireza Javanmardi , Pragati Jaiswal , Tewodros Amberbir Habtegebrial , Christen Millerdurai , Shaoxiang Wang , Alain Pagani , Didier Stricker

The emergence of generative AI and controllable diffusion has made image-to-image synthesis increasingly practical and efficient. However, when input images exhibit low entropy and sparse, the inherent characteristics of diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2025-01-14 Hao Wang , Xiwen Chen , Ashish Bastola , Jiayou Qin , Abolfazl Razi

We present a method for generating video sequences with coherent motion between a pair of input key frames. We adapt a pretrained large-scale image-to-video diffusion model (originally trained to generate videos moving forward in time from…

Computer Vision and Pattern Recognition · Computer Science 2025-02-13 Xiaojuan Wang , Boyang Zhou , Brian Curless , Ira Kemelmacher-Shlizerman , Aleksander Holynski , Steven M. Seitz

Effective human behavior modeling is critical for successful human-robot interaction. Current state-of-the-art approaches for predicting listening head behavior during dyadic conversations employ continuous-to-discrete representations,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-09 Tri Tung Nguyen Nguyen , Quang Tien Dam , Dinh Tuan Tran , Joo-Ho Lee

Keyframes are a standard representation for kinematic motion specification. Recent learned motion-inbetweening methods use keyframes as a way to control generative motion models, and are trained to generate life-like motion that matches the…

Graphics · Computer Science 2025-03-04 Purvi Goel , Haotian Zhang , C. Karen Liu , Kayvon Fatahalian

Diffusion-based inpainting is a powerful tool for the reconstruction of images from sparse data. Its quality strongly depends on the choice of known data. Optimising their spatial location -- the inpainting mask -- is challenging. A…

Image and Video Processing · Electrical Eng. & Systems 2022-05-17 Tobias Alt , Pascal Peter , Joachim Weickert

Creating expressive character animations is labor-intensive, requiring intricate manual adjustment of animators across space and time. Previous works on controllable motion generation often rely on a predefined set of dense spatio-temporal…

Graphics · Computer Science 2025-07-28 Inwoo Hwang , Jinseok Bae , Donggeun Lim , Young Min Kim

Recent advances in generative models have yielded impressive progress on motion in-betweening, allowing for more complex, varied, and realistic motion transitions. However, recent methods still exhibit noticeable limitations in preserving…

Graphics · Computer Science 2026-05-14 Shiyu Fan , Paul Henderson , Edmond S. L. Ho

Diffusion models that are based on iterative denoising have been recently proposed and leveraged in various generation tasks like image generation. Whereas, as a way inherently built for continuous data, existing diffusion models still have…

Computation and Language · Computer Science 2023-04-11 Jiaao Chen , Aston Zhang , Mu Li , Alex Smola , Diyi Yang

Controllable generation of 3D human motions becomes an important topic as the world embraces digital transformation. Existing works, though making promising progress with the advent of diffusion models, heavily rely on meticulously captured…

Computer Vision and Pattern Recognition · Computer Science 2024-01-25 Nhat M. Hoang , Kehong Gong , Chuan Guo , Michael Bi Mi

Prior masked modeling motion generation methods predominantly study text-to-motion. We present DiMo, a discrete diffusion-style framework, which extends masked modeling to bidirectional text--motion understanding and generation. Unlike…

Computer Vision and Pattern Recognition · Computer Science 2026-02-09 Ning Zhang , Zhengyu Li , Kwong Weng Loh , Mingxi Xu , Qi Wang , Zhengyu Wen , Xiaoyu He , Wei Zhao , Kehong Gong , Mingyuan Zhang

Portrait animation aims to generate photo-realistic videos from a single source image by reenacting the expression and pose from a driving video. While early methods relied on 3D morphable models or feature warping techniques, they often…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Mallikarjun B. R. , Fei Yin , Vikram Voleti , Nikita Drobyshev , Maksim Lapin , Aaryaman Vasishta , Varun Jampani

Video generation using diffusion-based models is constrained by high computational costs due to the frame-wise iterative diffusion process. This work presents a Diffusion Reuse MOtion (Dr. Mo) network to accelerate latent video generation.…

Computer Vision and Pattern Recognition · Computer Science 2024-09-20 Chenyu Wang , Shuo Yan , Yixuan Chen , Yujiang Wang , Mingzhi Dong , Xiaochen Yang , Dongsheng Li , Robert P. Dick , Qin Lv , Fan Yang , Tun Lu , Ning Gu , Li Shang

Generative models, particularly diffusion models, have made significant success in data synthesis across various modalities, including images, videos, and 3D assets. However, current diffusion models are computationally intensive, often…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Yuanzhi Zhu , Hanshu Yan , Huan Yang , Kai Zhang , Junnan Li

For bandwidth-constrained multimedia applications, simultaneously achieving ultra-low bitrate human video compression and accurate vertex prediction remains a critical challenge, as it demands the harmonization of dynamic motion modeling,…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Bolin Chen , Ru-Ling Liao , Yan Ye , Jie Chen , Shanzhi Yin , Xinrui Ju , Shiqi Wang , Yibo Fan

Diffusion models have shown remarkable success across a wide range of generative tasks. However, they often suffer from spatially inconsistent generation, arguably due to the inherent locality of their denoising mechanisms. This can yield…

Machine Learning · Computer Science 2026-02-04 Wenshuai Zhao , Zhiyuan Li , Yi Zhao , Mohammad Hassan Vali , Martin Trapp , Joni Pajarinen , Juho Kannala , Arno Solin

Interactive motion synthesis is essential in creating immersive experiences in entertainment applications, such as video games and virtual reality. However, generating animations that are both high-quality and contextually responsive…

Computer Vision and Pattern Recognition · Computer Science 2024-01-15 Tianyu Li , Calvin Qiao , Guanqiao Ren , KangKang Yin , Sehoon Ha
‹ Prev 1 2 3 10 Next ›