English
Related papers

Related papers: Bidirectional Autoregressive Diffusion Model for D…

200 papers

When hearing music, it is natural for people to dance to its rhythm. Automatic dance generation, however, is a challenging task due to the physical constraints of human motion and rhythmic alignment with target music. Conventional…

Graphics · Computer Science 2023-08-08 Qiaosong Qi , Le Zhuo , Aixi Zhang , Yue Liao , Fei Fang , Si Liu , Shuicheng Yan

Generating human motion from text has been dominated by denoising motion models either through diffusion or generative masking process. However, these models face great limitations in usability by requiring prior knowledge of the motion…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Ekkasit Pinyoanuntapong , Muhammad Usama Saleem , Pu Wang , Minwoo Lee , Srijan Das , Chen Chen

Natural and expressive human motion generation is the holy grail of computer animation. It is a challenging task, due to the diversity of possible motion, human perceptual sensitivity to it, and the difficulty of accurately describing it.…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Guy Tevet , Sigal Raab , Brian Gordon , Yonatan Shafir , Daniel Cohen-Or , Amit H. Bermano

Generating the motion of orchestral conductors from a given piece of symphony music is a challenging task since it requires a model to learn semantic music features and capture the underlying distribution of real conducting motion. Prior…

Audio and Speech Processing · Electrical Eng. & Systems 2023-11-14 Zhuoran Zhao , Jinbin Bai , Delong Chen , Debang Wang , Yubo Pan

Dancing with music is always an essential human art form to express emotion. Due to the high temporal-spacial complexity, long-term 3D realist dance generation synchronized with music is challenging. Existing methods suffer from the…

Computer Vision and Pattern Recognition · Computer Science 2023-08-24 Siqi Yang , Zejun Yang , Zhisheng Wang

Text-to-motion generation has attracted increasing attention in the research community recently, with potential applications in animation, virtual reality, robotics, and human-computer interaction. Diffusion and autoregressive models are…

Computer Vision and Pattern Recognition · Computer Science 2026-04-10 Kang Ding , Hongsong Wang , Jie Gui , Liang Wang

Recent advances in motion diffusion models have substantially improved the realism of human motion synthesis. However, existing approaches either rely on full-sequence diffusion models with bidirectional generation, which limits temporal…

Computer Vision and Pattern Recognition · Computer Science 2026-02-27 Qing Yu , Akihisa Watanabe , Kent Fujiwara

In music-driven dance motion generation, most existing methods use hand-crafted features and neglect that music foundation models have profoundly impacted cross-modal content generation. To bridge this gap, we propose a diffusion-based…

Sound · Computer Science 2025-02-28 Xinran Liu , Zhenhua Feng , Diptesh Kanojia , Wenwu Wang

We introduce a method to generate temporally coherent human animation from a single image, a video, or a random noise. This problem has been formulated as modeling of an auto-regressive generation, i.e., to regress past frames to decode…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Tserendorj Adiya , Jae Shin Yoon , Jungeun Lee , Sanghun Kim , Hwasup Lim

Dance requires skillful composition of complex movements that follow rhythmic, tonal and timbral features of music. Formally, generating dance conditioned on a piece of music can be expressed as a problem of modelling a high-dimensional…

Recently, diffusion models have shown their impressive ability in visual generation tasks. Besides static images, more and more research attentions have been drawn to the generation of realistic videos. The video generation not only has a…

Computer Vision and Pattern Recognition · Computer Science 2025-11-03 Yucheng Xing , Jinxing Yin , Xiaodong Liu

Text-driven human motion generation is a multimodal task that synthesizes human motion sequences conditioned on natural language. It requires the model to satisfy textual descriptions under varying conditional inputs, while generating…

Computer Vision and Pattern Recognition · Computer Science 2024-10-01 Xingyu Chen

Data-driven and controllable human motion synthesis and prediction are active research areas with various applications in interactive media and social robotics. Challenges remain in these fields for generating diverse motions given past…

Computer Vision and Pattern Recognition · Computer Science 2023-04-11 Wenjie Yin , Ruibo Tu , Hang Yin , Danica Kragic , Hedvig Kjellström , Mårten Björkman

Dance plays an important role as an artistic form and expression in human culture, yet automatically generating dance sequences is a significant yet challenging endeavor. Existing approaches often neglect the critical aspect of…

Computer Vision and Pattern Recognition · Computer Science 2026-03-11 Hongsong Wang , Ying Zhu , Xin Geng , Liang Wang

Diffusion models have shown promising results for a wide range of generative tasks with continuous data, such as image and audio synthesis. However, little progress has been made on using diffusion models to generate discrete symbolic music…

Sound · Computer Science 2023-10-24 Jincheng Zhang , György Fazekas , Charalampos Saitis

Advances in generative models and sequence learning have greatly promoted research in dance motion generation, yet current methods still suffer from coarse semantic control and poor coherence in long sequences. In this work, we present…

Graphics · Computer Science 2026-04-08 Oran Duan , Yinghua Shen , Yingzhu Lv , Luyang Jie , Yaxin Liu , Qiong Wu

Dance-to-music (D2M) generation aims to automatically compose music that is rhythmically and temporally aligned with dance movements. Existing methods typically rely on coarse rhythm embeddings, such as global motion features or binarized…

Sound · Computer Science 2026-03-03 Jinting Wang , Chenxing Li , Li Liu

Synthesize human motions from music, i.e., music to dance, is appealing and attracts lots of research interests in recent years. It is challenging due to not only the requirement of realistic and complex human motions for dance, but more…

Computer Vision and Pattern Recognition · Computer Science 2020-03-12 Wenlin Zhuang , Congyi Wang , Siyu Xia , Jinxiang Chai , Yangang Wang

We introduce Efficient Motion Diffusion Model (EMDM) for fast and high-quality human motion generation. Current state-of-the-art generative diffusion models have produced impressive results but struggle to achieve fast generation without…

Computer Vision and Pattern Recognition · Computer Science 2024-11-26 Wenyang Zhou , Zhiyang Dou , Zeyu Cao , Zhouyingcheng Liao , Jingbo Wang , Wenjia Wang , Yuan Liu , Taku Komura , Wenping Wang , Lingjie Liu

Automatic choreography generation is a challenging task because it often requires an understanding of two abstract concepts - music and dance - which are realized in the two different modalities, namely audio and video, respectively. In…

Multimedia · Computer Science 2018-11-05 Juheon Lee , Seohyun Kim , Kyogu Lee
‹ Prev 1 2 3 10 Next ›