跳到主要导航 跳到搜索 跳到主要内容

LawDNet: Enhanced Audio-Driven Lip Synthesis via Local Affine Warping Deformation

  • Junli Deng
  • , Yihao Luo*
  • , Xueting Yang
  • , Siyou Li
  • , Wei Wang
  • , Jinyang Guo
  • , Ping Shi
  • *此作品的通讯作者
  • Communication University of China
  • Imperial College London
  • The University of Hong Kong
  • Queen Mary University of London
  • Beijing University of Posts and Telecommunications

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

In the domain of photorealistic talking head generation, the fidelity of audio-driven lip motion synthesis is essential for realistic virtual interactions. Existing methods face two key challenges: a lack of vivacity due to limited diversity in generated lip poses and noticeable anamorphose motions caused by poor temporal coherence. To address these issues, we propose LawDNet, a novel deep-learning architecture enhancing lip synthesis through a Local Affine Warping Deformation mechanism. This mechanism models the intricate lip movements in response to the audio input by controllable non-linear warping fields. These fields consist of local affine transformations focused on abstract keypoints within deep feature maps, offering a novel universal paradigm for feature warping in networks. Additionally, LawDNet incorporates a dual-stream discriminator for improved frame-to-frame continuity and employs face normalization techniques to handle pose and scene variations. Extensive evaluations demonstrate LawDNet's superior robustness and lip movement dynamism performance compared to previous methods.

源语言英语
主期刊名2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Proceedings
编辑Bhaskar D Rao, Isabel Trancoso, Gaurav Sharma, Neelesh B. Mehta
出版商Institute of Electrical and Electronics Engineers Inc.
ISBN(电子版)9798350368741
DOI
出版状态已出版 - 2025
活动2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025 - Hyderabad, 印度
期限: 6 4月 202511 4月 2025

出版系列

姓名ICASSP, IEEE International Conference on Acoustics, Speech and Signal Processing - Proceedings
ISSN(印刷版)1520-6149

会议

会议2025 IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2025
国家/地区印度
Hyderabad
时期6/04/2511/04/25

指纹

探究 'LawDNet: Enhanced Audio-Driven Lip Synthesis via Local Affine Warping Deformation' 的科研主题。它们共同构成独一无二的指纹。

引用此