학술논문

Home

자료검색

학술논문

검색결과 돌아가기

검색화면

내보내기 프린트

On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora

Resource Type: Conference
Authors: Soky, Kak; Li, Sheng; Mimura, Masato; Chu, Chenhui; Kawahara, Tatsuya
Source: 2021 Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC) Signal and Information Processing Association Annual Summit and Conference (APSIPA ASC), 2021 Asia-Pacific. :433-437 Dec, 2021
Subject: Communication, Networking and Broadcast Technologies
Computing and Processing
Signal Processing and Analysis
Systematics
Information processing
Multitasking
Adversarial machine learning
Decoding
Speaker recognition
Optimization
Language
ISSN: 2640-0103

Online Access

Full Text (IEEE)

초록

We address the effective use of speaker information for automatic speech recognition (ASR) in a speaker-imbalanced dataset. Recently, joint speaker and speech recognition has been investigated in end-to-end (E2E) systems. However, speaker information as the output of speaker recognition (SRE) is not explicitly used for ASR in these systems. Inspired by speaker embedding for ASR, we propose a direct connection of SRE to the ASR decoder. The E2E architecture allows for backpropagating the ASR loss to the SRE decoder, resulting in joint optimisation. The architecture is beneficial for speaker-sparse datasets such as meetings and low-resource language settings, in which speaker clustering is conducted to compensate minor speakers. We also make a systematic comparison of our proposed method with other methods, including multi-task learning (MTL), adversarial learning (AL), and speaker attribute augmentation (SAug). It is shown that the use of speaker cluster information improves both ASR and SRE, and the proposed method outperforms other methods. It reduces errors of the baseline model by 3.35% and 8.23% for ASR and SRE, respectively.

공지

DAU Library

학술논문

요약정보

On the Use of Speaker Information for Automatic Speech Recognition in Speaker-imbalanced Corpora

Online Access

초록