Related papers: Bidirectional Autoregressive Diffusion Model for D…

DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation

When hearing music, it is natural for people to dance to its rhythm. Automatic dance generation, however, is a challenging task due to the physical constraints of human motion and rhythmic alignment with target music. Conventional…

Graphics · Computer Science 2023-08-08 Qiaosong Qi , Le Zhuo , Aixi Zhang , Yue Liao , Fei Fang , Si Liu , Shuicheng Yan

BAMM: Bidirectional Autoregressive Motion Model

Generating human motion from text has been dominated by denoising motion models either through diffusion or generative masking process. However, these models face great limitations in usability by requiring prior knowledge of the motion…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Ekkasit Pinyoanuntapong , Muhammad Usama Saleem , Pu Wang , Minwoo Lee , Srijan Das , Chen Chen

Human Motion Diffusion Model

Natural and expressive human motion generation is the holy grail of computer animation. It is a challenging task, due to the diversity of possible motion, human perceptual sensitivity to it, and the difficulty of accurately describing it.…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Guy Tevet , Sigal Raab , Brian Gordon , Yonatan Shafir , Daniel Cohen-Or , Amit H. Bermano

Taming Diffusion Models for Music-driven Conducting Motion Generation

Generating the motion of orchestral conductors from a given piece of symphony music is a challenging task since it requires a model to learn semantic music features and capture the underlying distribution of real conducting motion. Prior…

Audio and Speech Processing · Electrical Eng. & Systems 2023-11-14 Zhuoran Zhao , Jinbin Bai , Delong Chen , Debang Wang , Yubo Pan

LongDanceDiff: Long-term Dance Generation with Conditional Diffusion Model

Dancing with music is always an essential human art form to express emotion. Due to the high temporal-spacial complexity, long-term 3D realist dance generation synchronized with music is challenging. Existing methods suffer from the…

Computer Vision and Pattern Recognition · Computer Science 2023-08-24 Siqi Yang , Zejun Yang , Zhisheng Wang

Coordinate-Based Dual-Constrained Autoregressive Motion Generation

Text-to-motion generation has attracted increasing attention in the research community recently, with potential applications in animation, virtual reality, robotics, and human-computer interaction. Diffusion and autoregressive models are…

Computer Vision and Pattern Recognition · Computer Science 2026-04-10 Kang Ding , Hongsong Wang , Jie Gui , Liang Wang

Causal Motion Diffusion Models for Autoregressive Motion Generation

Recent advances in motion diffusion models have substantially improved the realism of human motion synthesis. However, existing approaches either rely on full-sequence diffusion models with bidirectional generation, which limits temporal…

Computer Vision and Pattern Recognition · Computer Science 2026-02-27 Qing Yu , Akihisa Watanabe , Kent Fujiwara

DGFM: Full Body Dance Generation Driven by Music Foundation Models

In music-driven dance motion generation, most existing methods use hand-crafted features and neglect that music foundation models have profoundly impacted cross-modal content generation. To bridge this gap, we propose a diffusion-based…

Sound · Computer Science 2025-02-28 Xinran Liu , Zhenhua Feng , Diptesh Kanojia , Wenwu Wang

Bidirectional Temporal Diffusion Model for Temporally Consistent Human Animation

We introduce a method to generate temporally coherent human animation from a single image, a video, or a random noise. This problem has been formulated as modeling of an auto-regressive generation, i.e., to regress past frames to decode…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Tserendorj Adiya , Jae Shin Yoon , Jungeun Lee , Sanghun Kim , Hwasup Lim

Transflower: probabilistic autoregressive dance generation with multimodal attention

Dance requires skillful composition of complex movements that follow rhythmic, tonal and timbral features of music. Formally, generating dance conditioned on a piece of music can be expressed as a problem of modelling a high-dimensional…

Sound · Computer Science 2022-06-14 Guillermo Valle-Pérez , Gustav Eje Henter , Jonas Beskow , André Holzapfel , Pierre-Yves Oudeyer , Simon Alexanderson

DANCER: Dance ANimation via Condition Enhancement and Rendering with diffusion model

Recently, diffusion models have shown their impressive ability in visual generation tasks. Besides static images, more and more research attentions have been drawn to the generation of realistic videos. The video generation not only has a…

Computer Vision and Pattern Recognition · Computer Science 2025-11-03 Yucheng Xing , Jinxing Yin , Xiaodong Liu

Text-driven Human Motion Generation with Motion Masked Diffusion Model

Text-driven human motion generation is a multimodal task that synthesizes human motion sequences conditioned on natural language. It requires the model to satisfy textual descriptions under varying conditional inputs, while generating…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 Xingyu Chen

Controllable Motion Synthesis and Reconstruction with Autoregressive Diffusion Models

Data-driven and controllable human motion synthesis and prediction are active research areas with various applications in interactive media and social robotics. Challenges remain in these fields for generating diverse motions given past…

Computer Vision and Pattern Recognition · Computer Science 2023-04-11 Wenjie Yin , Ruibo Tu , Hang Yin , Danica Kragic , Hedvig Kjellström , Mårten Björkman

Controllable Dance Generation with Style-Guided Motion Diffusion

Dance plays an important role as an artistic form and expression in human culture, yet automatically generating dance sequences is a significant yet challenging endeavor. Existing approaches often neglect the critical aspect of…

Computer Vision and Pattern Recognition · Computer Science 2026-03-11 Hongsong Wang , Ying Zhu , Xin Geng , Liang Wang

Fast Diffusion GAN Model for Symbolic Music Generation Controlled by Emotions

Diffusion models have shown promising results for a wide range of generative tasks with continuous data, such as image and audio synthesis. However, little progress has been made on using diffusion models to generate discrete symbolic music…

Sound · Computer Science 2023-10-24 Jincheng Zhang , György Fazekas , Charalampos Saitis

Listen to Rhythm, Choose Movements: Autoregressive Multimodal Dance Generation via Diffusion and Mamba with Decoupled Dance Dataset

Advances in generative models and sequence learning have greatly promoted research in dance motion generation, yet current methods still suffer from coarse semantic control and poor coherence in long sequences. In this work, we present…

Graphics · Computer Science 2026-04-08 Oran Duan , Yinghua Shen , Yingzhu Lv , Luyang Jie , Yaxin Liu , Qiong Wu

GACA-DiT: Diffusion-based Dance-to-Music Generation with Genre-Adaptive Rhythm and Context-Aware Alignment

Dance-to-music (D2M) generation aims to automatically compose music that is rhythmically and temporally aligned with dance movements. Existing methods typically rely on coarse rhythm embeddings, such as global motion features or binarized…

Sound · Computer Science 2026-03-03 Jinting Wang , Chenxing Li , Li Liu

Music2Dance: DanceNet for Music-driven Dance Generation

Synthesize human motions from music, i.e., music to dance, is appealing and attracts lots of research interests in recent years. It is challenging due to not only the requirement of realistic and complex human motions for dance, but more…

Computer Vision and Pattern Recognition · Computer Science 2020-03-12 Wenlin Zhuang , Congyi Wang , Siyu Xia , Jinxiang Chai , Yangang Wang

EMDM: Efficient Motion Diffusion Model for Fast and High-Quality Motion Generation

We introduce Efficient Motion Diffusion Model (EMDM) for fast and high-quality human motion generation. Current state-of-the-art generative diffusion models have produced impressive results but struggle to achieve fast generation without…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Wenyang Zhou , Zhiyang Dou , Zeyu Cao , Zhouyingcheng Liao , Jingbo Wang , Wenjia Wang , Yuan Liu , Taku Komura , Wenping Wang , Lingjie Liu

Listen to Dance: Music-driven choreography generation using Autoregressive Encoder-Decoder Network

Automatic choreography generation is a challenging task because it often requires an understanding of two abstract concepts - music and dance - which are realized in the two different modalities, namely audio and video, respectively. In…

Multimedia · Computer Science 2018-11-05 Juheon Lee , Seohyun Kim , Kyogu Lee