학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

BERT-LID: Leveraging BERT to Improve Spoken Language Identification

Resource Type: Working Paper
Authors: Nie, Yuting; Zhao, Junhong; Zhang, Wei-Qiang; Bai, Jinfeng
Source
Subject: Computer Science - Computation and Language
Computer Science - Sound
Electrical Engineering and Systems Science - Audio and Speech Processing
Language

Online Access

초록

Language identification is the task of automatically determining the identity of a language conveyed by a spoken segment. It has a profound impact on the multilingual interoperability of an intelligent speech system. Despite language identification attaining high accuracy on medium or long utterances(>3s), the performance on short utterances (<=1s) is still far from satisfactory. We propose a BERT-based language identification system (BERT-LID) to improve language identification performance, especially on short-duration speech segments. We extend the original BERT model by taking the phonetic posteriorgrams (PPG) derived from the front-end phone recognizer as input. Then we deployed the optimal deep classifier followed by it for language identification. Our BERT-LID model can improve the baseline accuracy by about 6.5% on long-segment identification and 19.9% on short-segment identification, demonstrating our BERT-LID's effectiveness to language identification.
Comment: accepted by ISCSLP 2022

공지

DAU Library

학술논문

요약정보

BERT-LID: Leveraging BERT to Improve Spoken Language Identification

Online Access

초록