Parallelizing Speaker-Attributed Speech Recognition for Meeting Browsing
- Resource Type
- Conference
- Authors
- Friedland, Gerald; Chong, Jike; Janin, Adam
- Source
- 2010 IEEE International Symposium on Multimedia Multimedia (ISM), 2010 IEEE International Symposium on. :121-128 Dec, 2010
- Subject
- Communication, Networking and Broadcast Technologies
Computing and Processing
Speech
Speech recognition
Graphics processing unit
Engines
Runtime
Inference algorithms
Feature extraction
- Language
The following article presents an application for browsing meeting recordings by speaker and keyword which we call the Meeting Diarist. The goal of the system is to enable browsing of the content with rich meta-data in a graphical user interface shortly after the end of meeting, even when the application runs on a contemporary laptop. We there-fore developed novel parallel methods for speaker diarization and multi-hypothesis speech recognition that are optimized to run on multicore and many core architectures. This paper presents the underlying parallel speaker diarization and speech recognition realizations, a comparison of results based on NIST RT07 evaluation data, and a description of the final application.