Environmental sound classification of western black-crowned gibbon habitat based on spectral subtraction and VGG16
- Resource Type
- Conference
- Authors
- Zhou, Xiaotao; Hu, Kunrong; Guan, Zhenhua
- Source
- 2022 IEEE 5th Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC) Advanced Information Management, Communicates, Electronic and Automation Control Conference (IMCEC), 2022 IEEE 5th. 5:578-582 Dec, 2022
- Subject
- Communication, Networking and Broadcast Technologies
Computing and Processing
Engineering Profession
Robotics and Control Systems
Training
Biological system modeling
Noise reduction
Interference
Feature extraction
Data models
Stability analysis
western black-crowned gibbon
ambient tones
spectral subtraction
mel's inverse spectral coefficient
VGG16
- Language
- ISSN
- 2693-2776
Deep learning-based approaches have been widely used for environmental sound classification tasks in recent years due to the growth and development of this field, creating the technical groundwork for the research of environmental sound classification. However, the presence of noise in the ecological environment leads to a dramatic decrease in its recognition rate. The major goal of this study is to improve the classification accuracy of the model in the presence of noise interference. It does this by proposing an ecological environmental sound classification approach in a noisy background. The input audio is first noise-reduced by spectral subtraction, and then the Mel frequency cepstral parameter features are extracted and fed into the fine-tuned VGG16 model for classification training. The classification of five environmental sounds in the western black-crowned gibbon habitat of the Mourning Mountain was identified with an accuracy of 98.76%.