跳到主要导航 跳到搜索 跳到主要内容

Mandarin Electro-Laryngeal Speech Enhancement based on Statistical Voice Conversion and Manual Tone Control

  • Zhaopeng Qian
  • , Haijun Niu*
  • , Li Wang
  • , Kazuhiro Kobayashi
  • , Shaochuan Zhang
  • , Tomoki Toda
  • *此作品的通讯作者
  • Beihang University
  • Nagoya University
  • Beijing Research Center of Urban Systems Engineering

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

Electro-Larynx can help the laryngectomees re-pronounce the voice, while the Electro-Laryngeal (EL) speech has a poor intelligibility and naturalness. Recently, voice conversion (VC) has been applied to enhance the EL speech, which achieves a good result. However, the complicated tone variation rule of continuous Mandarin EL speech takes a new challenge into enhancement of EL speech by VC. In this paper, a novel framework combining manual tone control (MTC) and statistical VC is proposed to enhance the continuous Mandarin EL speech. As statistical VC methods, GMM-based VC and CLDNN-based VC are implemented for the proposed framework. The objective and subj ective evaluations are designed to validate the proposed framework. The experimental results have demonstrated that 1) the combination of MTC and statistical VC yields significant improvements in both naturalness and intelligibility of the enhanced Mandarin EL speech, 2) the word perception error rates of the enhanced Mandarin EL speech is decreased from 11.35% of Mandarin EL speech with MTC to 5.61 % by using statistical VC, and 3) the proposed framework achieves the average tone accuracy of 26.59% higher than that of original continuous Mandarin EL speech.

源语言英语
主期刊名2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings
出版商Institute of Electrical and Electronics Engineers Inc.
546-552
页数7
ISBN(电子版)9789881476890
出版状态已出版 - 2021
活动2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Tokyo, 日本
期限: 14 12月 202117 12月 2021

出版系列

姓名2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021 - Proceedings

会议

会议2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference, APSIPA ASC 2021
国家/地区日本
Tokyo
时期14/12/2117/12/21

指纹

探究 'Mandarin Electro-Laryngeal Speech Enhancement based on Statistical Voice Conversion and Manual Tone Control' 的科研主题。它们共同构成独一无二的指纹。

引用此