eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Adaptive Endpointing with Deep Contextual Multi-Armed Bandits

Resource Type: Conference
Authors: Min, Do June; Stolcke, Andreas; Raju, Anirudh; Vaz, Colin; He, Di; Ravichandran, Venkatesh; Trinh, Viet Anh
Source: ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2023 - 2023 IEEE International Conference on. :1-5 Jun, 2023
Subject: Bioengineering
Communication, Networking and Broadcast Technologies
Computing and Processing
Signal Processing and Analysis
Degradation
Adaptation models
Heuristic algorithms
Neural networks
Signal processing algorithms
Speech recognition
Signal processing
endpointing
multi-armed bandits
automatic speech recognition
turn taking
dialog modeling
Language
ISSN: 2379-190X

Online Access

Full Text (IEEE)

초록

Current endpointing (EP) solutions learn in a supervised framework, which does not allow the model to incorporate feedback and improve in an online setting. Also, it is common practice to utilize costly grid-search to find the best configuration for an endpointing model. In this paper, we aim to provide a solution for adaptive endpointing by proposing an efficient method for choosing an optimal endpointing configuration given utterance-level audio features in an online setting, while avoiding hyperparameter grid-search. Our method does not require ground truth labels, and uses only online learning from reward signals. Specifically, we propose a deep contextual multi-armed bandit-based approach, combining the representational power of neural networks with the action exploration behavior of Thomp-son modeling algorithms. We compare our approach to several baselines, and show that our deep bandit models also succeed in reducing early cutoff errors while maintaining low latency.

공지

DAU Library

eArticles

요약정보

Adaptive Endpointing with Deep Contextual Multi-Armed Bandits

Online Access

초록