跳到主要导航 跳到搜索 跳到主要内容

Noise-robust voice activity detector based on hidden semi-Markov models

  • Beihang University

科研成果: 书/报告/会议事项章节会议稿件同行评审

摘要

This paper concentrates on speech duration distributions that are usually invariant to noises and proposes a noise-robust and real-time voice activity detector (VAD) using the hidden semi-Markov model (HSMM) to explicitly model state durations. Motivated by statistical observations and tests on TIMIT and the IEEE sentence database, we use Weibull distributions to model state durations approximately and estimate their parameters by maximum likelihood estimators. The final VAD decision is made according to the likelihood ratio test (LRT) incorporating state prior knowledge and modified forward variables. An efficient way that recursively calculates modified forward variables is devised and a dynamic adjustment scheme is used to update parameters. Experiments on noisy speech data show that the proposed method performs more robustly and accurately than the standard ITU-T G.729B VAD and AMR2.

源语言英语
主期刊名Proceedings - 2010 20th International Conference on Pattern Recognition, ICPR 2010
出版商Institute of Electrical and Electronics Engineers Inc.
81-84
页数4
ISBN(印刷版)9780769541099
DOI
出版状态已出版 - 2010

出版系列

姓名Proceedings - International Conference on Pattern Recognition
ISSN(印刷版)1051-4651

指纹

探究 'Noise-robust voice activity detector based on hidden semi-Markov models' 的科研主题。它们共同构成独一无二的指纹。

引用此