Related papers: DiffusionPhase: Motion Diffusion in Frequency Doma…

Towards Open Domain Text-Driven Synthesis of Multi-Person Motions

This work aims to generate natural and diverse group motions of multiple humans from textual descriptions. While single-person text-to-motion generation is extensively studied, it remains challenging to synthesize motions for more than one…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Mengyi Shan , Lu Dong , Yutao Han , Yuan Yao , Tao Liu , Ifeoma Nwogu , Guo-Jun Qi , Mitch Hill

Executing your Commands via Motion Diffusion in Latent Space

We study a challenging task, conditional human motion generation, which produces plausible human motion sequences according to various conditional inputs, such as action classes or textual descriptors. Since human motions are highly diverse…

Computer Vision and Pattern Recognition · Computer Science 2023-05-22 Xin Chen , Biao Jiang , Wen Liu , Zilong Huang , Bin Fu , Tao Chen , Jingyi Yu , Gang Yu

MotionDiffuse: Text-Driven Human Motion Generation with Diffusion Model

Human motion modeling is important for many modern graphics applications, which typically require professional skills. In order to remove the skill barriers for laymen, recent motion generation methods can directly generate human motions…

Computer Vision and Pattern Recognition · Computer Science 2022-09-01 Mingyuan Zhang , Zhongang Cai , Liang Pan , Fangzhou Hong , Xinying Guo , Lei Yang , Ziwei Liu

Fg-T2M: Fine-Grained Text-Driven Human Motion Generation via Diffusion Model

Text-driven human motion generation in computer vision is both significant and challenging. However, current methods are limited to producing either deterministic or imprecise motion sequences, failing to effectively control the temporal…

Computer Vision and Pattern Recognition · Computer Science 2023-09-13 Yin Wang , Zhiying Leng , Frederick W. B. Li , Shun-Cheng Wu , Xiaohui Liang

Text-driven Human Motion Generation with Motion Masked Diffusion Model

Text-driven human motion generation is a multimodal task that synthesizes human motion sequences conditioned on natural language. It requires the model to satisfy textual descriptions under varying conditional inputs, while generating…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 Xingyu Chen

Move-in-2D: 2D-Conditioned Human Motion Generation

Generating realistic human videos remains a challenging task, with the most effective methods currently relying on a human motion sequence as a control signal. Existing approaches often use existing motion extracted from other videos, which…

Computer Vision and Pattern Recognition · Computer Science 2024-12-18 Hsin-Ping Huang , Yang Zhou , Jui-Hsien Wang , Difan Liu , Feng Liu , Ming-Hsuan Yang , Zhan Xu

Synthesizing Long-Term Human Motions with Diffusion Models via Coherent Sampling

Text-to-motion generation has gained increasing attention, but most existing methods are limited to generating short-term motions that correspond to a single sentence describing a single action. However, when a text stream describes a…

Computer Vision and Pattern Recognition · Computer Science 2023-08-04 Zhao Yang , Bing Su , Ji-Rong Wen

Human Motion Generation: A Survey

Human motion generation aims to generate natural human pose sequences and shows immense potential for real-world applications. Substantial progress has been made recently in motion data collection technologies and generation methods, laying…

Computer Vision and Pattern Recognition · Computer Science 2023-11-16 Wentao Zhu , Xiaoxuan Ma , Dongwoo Ro , Hai Ci , Jinlu Zhang , Jiaxin Shi , Feng Gao , Qi Tian , Yizhou Wang

Human Motion Diffusion Model

Natural and expressive human motion generation is the holy grail of computer animation. It is a challenging task, due to the diversity of possible motion, human perceptual sensitivity to it, and the difficulty of accurately describing it.…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Guy Tevet , Sigal Raab , Brian Gordon , Yonatan Shafir , Daniel Cohen-Or , Amit H. Bermano

Diffusion Path Alignment for Long-Range Motion Generation and Domain Transitions

Long-range human movement generation remains a central challenge in computer vision and graphics. Generating coherent transitions across semantically distinct motion domains remains largely unexplored. This capability is particularly…

Computer Vision and Pattern Recognition · Computer Science 2026-04-07 Haichao Wang , Alexander Okupnik , Yuxing Han , Gene Wen , Johannes Schneider , Kyriakos Flouris

AMD: Autoregressive Motion Diffusion

Human motion generation aims to produce plausible human motion sequences according to various conditional inputs, such as text or audio. Despite the feasibility of existing methods in generating motion based on short prompts and simple…

Multimedia · Computer Science 2024-11-12 Bo Han , Hao Peng , Minjing Dong , Yi Ren , Yixuan Shen , Chang Xu

Enhanced Fine-grained Motion Diffusion for Text-driven Human Motion Synthesis

The emergence of text-driven motion synthesis technique provides animators with great potential to create efficiently. However, in most cases, textual expressions only contain general and qualitative motion descriptions, while lack fine…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Dong Wei , Xiaoning Sun , Huaijiang Sun , Bin Li , Shengxiang Hu , Weiqing Li , Jianfeng Lu

Make-An-Animation: Large-Scale Text-conditional 3D Human Motion Generation

Text-guided human motion generation has drawn significant interest because of its impactful applications spanning animation and robotics. Recently, application of diffusion models for motion generation has enabled improvements in the…

Computer Vision and Pattern Recognition · Computer Science 2023-05-17 Samaneh Azadi , Akbar Shah , Thomas Hayes , Devi Parikh , Sonal Gupta

Shape Conditioned Human Motion Generation with Diffusion Model

Human motion synthesis is an important task in computer graphics and computer vision. While focusing on various conditioning signals such as text, action class, or audio to guide the generation process, most existing methods utilize…

Computer Vision and Pattern Recognition · Computer Science 2024-05-14 Kebing Xue , Hyewon Seo

MotionStreamer: Streaming Motion Generation via Diffusion-based Autoregressive Model in Causal Latent Space

This paper addresses the challenge of text-conditioned streaming motion generation, which requires us to predict the next-step human pose based on variable-length historical motions and incoming texts. Existing methods struggle to achieve…

Computer Vision and Pattern Recognition · Computer Science 2025-08-08 Lixing Xiao , Shunlin Lu , Huaijin Pi , Ke Fan , Liang Pan , Yueer Zhou , Ziyong Feng , Xiaowei Zhou , Sida Peng , Jingbo Wang

Human Motion Diffusion as a Generative Prior

Recent work has demonstrated the significant potential of denoising diffusion models for generating human motion, including text-to-motion capabilities. However, these methods are restricted by the paucity of annotated motion data, a focus…

Computer Vision and Pattern Recognition · Computer Science 2023-08-31 Yonatan Shafir , Guy Tevet , Roy Kapon , Amit H. Bermano

FrankenMotion: Part-level Human Motion Generation and Composition

Human motion generation from text prompts has made remarkable progress in recent years. However, existing methods primarily rely on either sequence-level or action-level descriptions due to the absence of fine-grained, part-level motion…

Computer Vision and Pattern Recognition · Computer Science 2026-01-19 Chuqiao Li , Xianghui Xie , Yong Cao , Andreas Geiger , Gerard Pons-Moll

Seamless Human Motion Composition with Blended Positional Encodings

Conditional human motion generation is an important topic with many applications in virtual reality, gaming, and robotics. While prior works have focused on generating motion guided by text, music, or scenes, these typically result in…

Computer Vision and Pattern Recognition · Computer Science 2024-02-26 German Barquero , Sergio Escalera , Cristina Palmero

Flexible Motion In-betweening with Diffusion Models

Motion in-betweening, a fundamental task in character animation, consists of generating motion sequences that plausibly interpolate user-provided keyframe constraints. It has long been recognized as a labor-intensive and challenging…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Setareh Cohan , Guy Tevet , Daniele Reda , Xue Bin Peng , Michiel van de Panne

FTMoMamba: Motion Generation with Frequency and Text State Space Models

Diffusion models achieve impressive performance in human motion generation. However, current approaches typically ignore the significance of frequency-domain information in capturing fine-grained motions within the latent space (e.g., low…

Computer Vision and Pattern Recognition · Computer Science 2024-11-27 Chengjian Li , Xiangbo Shu , Qiongjie Cui , Yazhou Yao , Jinhui Tang