학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

On Usable Speech Detection by Linear Multi-Scale Decomposition for Speaker Identification

Resource Type
Authors: Amel Ben Slimane; Ezzedine Ben Braiek; Wajdi Ghezaiel
Source: International Journal of Electrical and Computer Engineering (IJECE). 6:2766
Subject: Discrete wavelet transform
Speaker diarisation
Voice activity detection
General Computer Science
Computer science
Speech recognition
Speech coding
Electrical and Electronic Engineering
USable
Speech processing
Linear predictive coding
Speaker recognition
Language
ISSN: 2088-8708

Online Access

초록

Usable speech is a novel concept of processing co-channel speech data. It is proposed to extract minimally corrupted speech that is considered useful for various speech processing systems. In this paper, we are interested for co-channel speaker identification (SID). We employ a new proposed usable speech extraction method based on the pitch information obtained from linear multi-scale decomposition by discrete wavelet transform. The idea is to retain the speech segments that have only one pitch detected and remove the others. Detected Usable speech was used as input for speaker identification system. The system is evaluated on co-channel speech and results show a significant improvement across various Target to Interferer Ratio (TIR) for speaker identification system.

공지

DAU Library

학술논문

요약정보

On Usable Speech Detection by Linear Multi-Scale Decomposition for Speaker Identification

Online Access

초록