跳到主要导航 跳到搜索 跳到主要内容

DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation

  • Qiaosong Qi
  • , Le Zhuo
  • , Aixi Zhang*
  • , Yue Liao
  • , Fei Fang
  • , Si Liu
  • , Shuicheng Yan
  • *此作品的通讯作者
  • Alibaba Group Holding Ltd.
  • Beihang University
  • Skyworks

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

When hearing music, it is natural for people to dance to its rhythm. Automatic dance generation, however, is a challenging task due to the physical constraints of human motion and rhythmic alignment with target music. Conventional autoregressive methods introduce compounding errors during sampling and struggle to capture the long-term structure of dance sequences. To address these limitations, we present a novel cascaded motion diffusion model, DiffDance, designed for high-resolution, long-form dance generation. This model comprises a music-to-dance diffusion model and a sequence super-resolution diffusion model. To bridge the gap between music and motion for conditional generation, DiffDance employs a pretrained audio representation learning model to extract music embeddings and further align its embedding space to motion via contrastive loss. During training our cascaded diffusion model, we also incorporate multiple geometric losses to constrain the model outputs to be physically plausible and add a dynamic loss weight that adaptively changes over diffusion timesteps to facilitate sample diversity. Through comprehensive experiments performed on the benchmark dataset AIST++, we demonstrate that DiffDance is capable of generating realistic dance sequences that align effectively with the input music. These results are comparable to those achieved by state-of-the-art autoregressive methods.

源语言英语
主期刊名MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia
出版商Association for Computing Machinery, Inc
1374-1382
页数9
ISBN(电子版)9798400701085
DOI
出版状态已出版 - 27 10月 2023
活动31st ACM International Conference on Multimedia, MM 2023 - Ottawa, 加拿大
期限: 29 10月 20233 11月 2023

出版系列

姓名MM 2023 - Proceedings of the 31st ACM International Conference on Multimedia

会议

会议31st ACM International Conference on Multimedia, MM 2023
国家/地区加拿大
Ottawa
时期29/10/233/11/23

指纹

探究 'DiffDance: Cascaded Human Motion Diffusion Model for Dance Generation' 的科研主题。它们共同构成独一无二的指纹。

引用此