학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models

Resource Type: Working Paper
Authors: Tseng, Liang-Hsuan; Fu, Yu-Kuan; Chang, Heng-Jui; Lee, Hung-yi
Source
Subject: Computer Science - Computation and Language
Electrical Engineering and Systems Science - Audio and Speech Processing
Language

Online Access

초록

Code-switching (CS) is common in daily conversations where more than one language is used within a sentence. The difficulties of CS speech recognition lie in alternating languages and the lack of transcribed data. Therefore, this paper uses the recently successful self-supervised learning (SSL) methods to leverage many unlabeled speech data without CS. We show that hidden representations of SSL models offer frame-level language identity even if the models are trained with English speech only. Jointly training CTC and language identification modules with self-supervised speech representations improves CS speech recognition performance. Furthermore, using multilingual speech data for pre-training obtains the best CS speech recognition.
Comment: Submitted to ICASSP 2022

공지

DAU Library

학술논문

요약정보

Mandarin-English Code-switching Speech Recognition with Self-supervised Speech Representation Models

Online Access

초록