학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

A Self-adapting GMM based Voice Activity Detection

Resource Type: Conference
Authors: Wu, Xiukun; Zhu, Mengyao; Wu, Renjie; Zhu, Xiaoqiang
Source: 2018 IEEE 23rd International Conference on Digital Signal Processing (DSP) Digital Signal Processing (DSP), 2018 IEEE 23rd International Conference on. :1-5 Nov, 2018
Subject: Bioengineering
Communication, Networking and Broadcast Technologies
Computing and Processing
Fields, Waves and Electromagnetics
Signal Processing and Analysis
Noise measurement
Signal to noise ratio
Acoustics
Voice activity detection
Feature extraction
Smoothing methods
Additive noise
VAD
far-field GMM
adaptive threshold
SHR
NHR
Language
ISSN: 2165-3577

Online Access

Full Text (IEEE)

초록

Voice activity detection (VAD) is a very challenging problem in adverse acoustic environments (e.g. far-field and conditions with different types of noise). In this paper, we proposed a Gaussian mixture model (GMM) for log-energy distribution of noise and (noisy) speech, where the distribution of these two components can be self-adapting in non-stationary circumstances. An adaptive threshold based on the GMM parameters of these two components represents a reasonable bound between noise and speech, which can lead to an accurate VAD in various noise conditions. To further improve speech hit rate (SHR) and non-speech hit rate (NHR), some constraints are introduced to this proposed GMM for reliability. Experimental results demonstrate that the proposed method yields remarkable performance for SHR and NHR.

공지

DAU Library

학술논문

요약정보

A Self-adapting GMM based Voice Activity Detection

Online Access

초록