3D Audio Signal Processing Systems for Speech Enhancement and Sound Localization and Detection
- Resource Type
- Conference
- Authors
- Bai, Jisheng; Huang, Siwei; Yin, Han; Jia, Yafei; Wang, Mou; Chen, Jianfeng
- Source
- ICASSP 2023 - 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) Acoustics, Speech and Signal Processing (ICASSP), ICASSP 2023 - 2023 IEEE International Conference on. :1-2 Jun, 2023
- Subject
- Bioengineering
Communication, Networking and Broadcast Technologies
Computing and Processing
Signal Processing and Analysis
Location awareness
Three-dimensional displays
Signal processing
Speech enhancement
Acoustics
3D audio
Task analysis
L3DAS23
speech enhancement
sound localization and detection
deep learning
- Language
- ISSN
- 2379-190X
The L3DAS23 of ICASSP Signal Processing Grand Challenge encourages research on 3D audio signal processing, such as 3D speech enhancement (SE) and 3D sound localization and detection (SELD). In this paper, we propose a two-stage system based on DPRNN and UNet for the SE task and a Conformer-based system for the SELD task. The proposed SE and SELD systems are evaluated on the L3DAS23 blind test sets. Results show that the proposed methods achieve state-of-the-art performance for 3D SE and SELD.