Related papers: Human Motion Diffusion Model

Back to Basics: Motion Representation Matters for Human Motion Generation Using Diffusion Model

Diffusion models have emerged as a widely utilized and successful methodology in human motion synthesis. Task-oriented diffusion models have significantly advanced action-to-motion, text-to-motion, and audio-to-motion applications. In this…

Computer Vision and Pattern Recognition · Computer Science 2025-12-05 Yuduo Jin , Brandon Haworth

Text-driven Human Motion Generation with Motion Masked Diffusion Model

Text-driven human motion generation is a multimodal task that synthesizes human motion sequences conditioned on natural language. It requires the model to satisfy textual descriptions under varying conditional inputs, while generating…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 Xingyu Chen

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Human motion modeling is important for many modern graphics applications, which typically require professional skills. In order to remove the skill barriers for laymen, recent motion generation methods can directly generate human motions…

Computer Vision and Pattern Recognition · Computer Science 2022-09-01 Mingyuan Zhang , Zhongang Cai , Liang Pan , Fangzhou Hong , Xinying Guo , Lei Yang , Ziwei Liu

EMDM: Efficient Motion Diffusion Model for Fast and High-Quality Motion Generation

We introduce Efficient Motion Diffusion Model (EMDM) for fast and high-quality human motion generation. Current state-of-the-art generative diffusion models have produced impressive results but struggle to achieve fast generation without…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Wenyang Zhou , Zhiyang Dou , Zeyu Cao , Zhouyingcheng Liao , Jingbo Wang , Wenjia Wang , Yuan Liu , Taku Komura , Wenping Wang , Lingjie Liu

M2D2M: Multi-Motion Generation from Text with Discrete Diffusion Models

We introduce the Multi-Motion Discrete Diffusion Models (M2D2M), a novel approach for human motion generation from textual descriptions of multiple actions, utilizing the strengths of discrete diffusion models. This approach adeptly…

Computer Vision and Pattern Recognition · Computer Science 2024-07-22 Seunggeun Chi , Hyung-gun Chi , Hengbo Ma , Nakul Agarwal , Faizan Siddiqui , Karthik Ramani , Kwonjoon Lee

Executing your Commands via Motion Diffusion in Latent Space

We study a challenging task, conditional human motion generation, which produces plausible human motion sequences according to various conditional inputs, such as action classes or textual descriptors. Since human motions are highly diverse…

Computer Vision and Pattern Recognition · Computer Science 2023-05-22 Xin Chen , Biao Jiang , Wen Liu , Zilong Huang , Bin Fu , Tao Chen , Jingyi Yu , Gang Yu

Strong and Controllable 3D Motion Generation

Human motion generation is a significant pursuit in generative computer vision with widespread applications in film-making, video games, AR/VR, and human-robot interaction. Current methods mainly utilize either diffusion-based generative…

Computer Vision and Pattern Recognition · Computer Science 2025-02-03 Canxuan Gang

Realistic Human Motion Generation with Cross-Diffusion Models

We introduce the Cross Human Motion Diffusion Model (CrossDiff), a novel approach for generating high-quality human motion based on textual descriptions. Our method integrates 3D and 2D information using a shared transformer network within…

Computer Vision and Pattern Recognition · Computer Science 2024-08-06 Zeping Ren , Shaoli Huang , Xiu Li

Priority-Centric Human Motion Generation in Discrete Latent Space

Text-to-motion generation is a formidable task, aiming to produce human motions that align with the input text while also adhering to human capabilities and physical laws. While there have been advancements in diffusion models, their…

Computer Vision and Pattern Recognition · Computer Science 2023-08-31 Hanyang Kong , Kehong Gong , Dongze Lian , Michael Bi Mi , Xinchao Wang

Diffusion Motion: Generate Text-Guided 3D Human Motion by Diffusion Model

We propose a simple and novel method for generating 3D human motion from complex natural language sentences, which describe different velocity, direction and composition of all kinds of actions. Different from existing methods that use…

Computer Vision and Pattern Recognition · Computer Science 2023-04-17 Zhiyuan Ren , Zhihong Pan , Xin Zhou , Le Kang

FG-MDM: Towards Zero-Shot Human Motion Generation via ChatGPT-Refined Descriptions

Recently, significant progress has been made in text-based motion generation, enabling the generation of diverse and high-quality human motions that conform to textual descriptions. However, generating motions beyond the distribution of…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Xu Shi , Wei Yao , Chuanchen Luo , Junran Peng , Hongwen Zhang , Yunlian Sun

RDM: Recurrent Diffusion Model for Human Motion Generation

Human motion generation is a challenging task due to its high dimensionality and the difficulty of generating fine-grained motions. Diffusion methods have been proposed due to their high sample quality and expressiveness. Early approaches…

Computer Vision and Pattern Recognition · Computer Science 2026-03-10 Mirgahney Mohamed , Harry Jake Cunningham , Marc P. Deisenroth , Lourdes Agapito

MMM: Generative Masked Motion Model

Recent advances in text-to-motion generation using diffusion and autoregressive models have shown promising results. However, these models often suffer from a trade-off between real-time performance, high fidelity, and motion editability.…

Computer Vision and Pattern Recognition · Computer Science 2024-03-29 Ekkasit Pinyoanuntapong , Pu Wang , Minwoo Lee , Chen Chen

Guided Motion Diffusion for Controllable Human Motion Synthesis

Denoising diffusion models have shown great promise in human motion synthesis conditioned on natural language descriptions. However, integrating spatial constraints, such as pre-defined motion trajectories and obstacles, remains a challenge…

Computer Vision and Pattern Recognition · Computer Science 2023-10-31 Korrawe Karunratanakul , Konpat Preechakul , Supasorn Suwajanakorn , Siyu Tang

Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model

Text-driven human motion generation in computer vision is both significant and challenging. However, current methods are limited to producing either deterministic or imprecise motion sequences, failing to effectively control the temporal…

Computer Vision and Pattern Recognition · Computer Science 2023-09-13 Yin Wang , Zhiying Leng , Frederick W. B. Li , Shun-Cheng Wu , Xiaohui Liang

MixerMDM: Learnable Composition of Human Motion Diffusion Models

Generating human motion guided by conditions such as textual descriptions is challenging due to the need for datasets with pairs of high-quality motion and their corresponding conditions. The difficulty increases when aiming for finer…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Pablo Ruiz-Ponce , German Barquero , Cristina Palmero , Sergio Escalera , José García-Rodríguez

Bidirectional Autoregressive Diffusion Model for Dance Generation

Dance serves as a powerful medium for expressing human emotions, but the lifelike generation of dance is still a considerable challenge. Recently, diffusion models have showcased remarkable generative abilities across various domains. They…

Sound · Computer Science 2024-06-25 Canyu Zhang , Youbao Tang , Ning Zhang , Ruei-Sung Lin , Mei Han , Jing Xiao , Song Wang

Causal Motion Diffusion Models for Autoregressive Motion Generation

Recent advances in motion diffusion models have substantially improved the realism of human motion synthesis. However, existing approaches either rely on full-sequence diffusion models with bidirectional generation, which limits temporal…

Computer Vision and Pattern Recognition · Computer Science 2026-02-27 Qing Yu , Akihisa Watanabe , Kent Fujiwara

Shape Conditioned Human Motion Generation with Diffusion Model

Human motion synthesis is an important task in computer graphics and computer vision. While focusing on various conditioning signals such as text, action class, or audio to guide the generation process, most existing methods utilize…

Computer Vision and Pattern Recognition · Computer Science 2024-05-14 Kebing Xue , Hyewon Seo

FLAME: Free-form Language-based Motion Synthesis & Editing

Text-based motion generation models are drawing a surge of interest for their potential for automating the motion-making process in the game, animation, or robot industries. In this paper, we propose a diffusion-based motion synthesis and…

Computer Vision and Pattern Recognition · Computer Science 2023-01-03 Jihoon Kim , Jiseob Kim , Sungjoon Choi