A robust voice activity detector based on weibull and Gaussian mixture distribution

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

In this paper, we focus on the observation and state duration distributions in hidden semi-Markov model (HSMM)-based voice activity detection. To perform robustly in noisy environment, firstly, acoustic features of noisy speech are extracted by Mel-frequency cepstrum processor after filtering the raw speech with a modified Wiener filter. According to the statistic on TIMIT database, we use Gaussian Mixture distributions (GMD) for both speech and non-speech state to correlate the MFCC feature vectors and state sequences. The transition probability in HSMM is not a constant like in HMM but depends on the elapsed time in last state, and is modeled by Weibull distribution (WD) in this paper. The final VAD decision is made according to the likelihood ratio test (LRT) incorporating state prior knowledge. Also a adaptive threshold is used to achieve better detection results. Experiments on noisy speech data show that the proposed method performs more robustly and accurately than the standard ITU-T G.729B, AMR2, HMM-based VAD and VAD using Laplacian-Gaussian model.

Original languageEnglish
Title of host publicationICSPS 2010 - Proceedings of the 2010 2nd International Conference on Signal Processing Systems
PagesV226-V230
DOIs
StatePublished - 2010
Event2010 2nd International Conference on Signal Processing Systems, ICSPS 2010 - Dalian, China
Duration: 5 Jul 20107 Jul 2010

Publication series

NameICSPS 2010 - Proceedings of the 2010 2nd International Conference on Signal Processing Systems
Volume2

Conference

Conference2010 2nd International Conference on Signal Processing Systems, ICSPS 2010
Country/TerritoryChina
CityDalian
Period5/07/107/07/10

Keywords

  • Gaussian Mixture Distribution
  • Voice activity detection
  • Weibull distribution

Fingerprint

Dive into the research topics of 'A robust voice activity detector based on weibull and Gaussian mixture distribution'. Together they form a unique fingerprint.

Cite this