Clustering speakers by their voices
- Resource Type
- Conference
- Authors
- Solomonoff, A.; Mielke, A.; Schmidt, M.; Gish, H.
- Source
- Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181) Acoustics, speech, and signal processing Acoustics, Speech and Signal Processing, 1998. Proceedings of the 1998 IEEE International Conference on. 2:757-760 vol.2 1998
- Subject
- Signal Processing and Analysis
Components, Circuits, Devices and Systems
Labeling
TV broadcasting
Voice mail
Organizing
Speech recognition
Information retrieval
Speech processing
Radio broadcasting
Audio recording
Application software
- Language
- ISSN
- 1520-6149
The problem of clustering speakers by their voices is addressed. With the mushrooming of available speech data from television broadcasts to voice mail, automatic systems for archive retrieval, organizing and labeling by speaker are necessary. Clustering conversations by speaker is a solution to all three of the above tasks. Another application for speaker clustering is to group utterances together for speaker adaptation in speech recognition. Metrics based on purity and completeness of clusters are introduced. Next our approach to speaker clustering is described and finally experimental results on a subset of the Switchboard corpus are presented.