Application of speaker modification techniques to phonetic vocoding
- Resource Type
- Conference
- Authors
- Ribeiro, C.M.; Trancoso, I.M.
- Source
- Proceeding of Fourth International Conference on Spoken Language Processing. ICSLP '96 Spoken language processing Spoken Language, 1996. ICSLP 96. Proceedings., Fourth International Conference on. 1:306-309 vol.1 1996
- Subject
- Signal Processing and Analysis
Communication, Networking and Broadcast Technologies
Computing and Processing
Vocoders
Bit rate
Speech recognition
Loudspeakers
Linear predictive coding
Frequency
Bandwidth
Hidden Markov models
Statistics
Polynomials
- Language
The goal of the work described in the paper is to develop a very low bit rate vocoding scheme. The vocoder is a typical LPC vocoder, whose parameters are post-processed on a phone-by-phone basis, resulting in a variable bit rate segment vocoder. Given the well known speaker recognizability problems presented by vocoders at such low bit rates, the authors have attempted to integrate a speaker modification method based on altering the formant frequencies and bandwidths of vowel segments. This is done by transmitting the mean value and standard deviation of the radius and angle of the poles corresponding to formant frequencies for each phone. In the decoder stage, the phone index is used to retrieve a set of normalized values from a codebook of 'typical' phones. This set is speaker adapted to preserve the static characteristics (average and standard deviation) but relies in the typical phone to represent the dynamic characteristics such as formant trajectories.