eArticles

Home

eArticles

검색결과 돌아가기

검색화면

Export 프린트

Automated Classification of Depression Severity Using Speech - A Comparison of Two Machine Learning Architectures

Resource Type: Conference
Authors: Aharonson, Vered; Nooy, Alexandra de; Bulkin, Seth; Sessel, Gareth
Source: 2020 IEEE International Conference on Healthcare Informatics (ICHI) Healthcare Informatics (ICHI), 2020 IEEE International Conference on. :1-4 Nov, 2020
Subject: Bioengineering
Computing and Processing
Signal Processing and Analysis
Biological system modeling
Machine learning
Speech recognition
Predictive models
Depression
Audio recording
Interviews
Speech analytics
Speech signal processing for healthcare
Depression recognition
disease severity discrimination
multi-stage classifiers
Language
ISSN: 2575-2634

Online Access

Full Text (IEEE)

초록

Depression affects approximately 300 million people worldwide, resulting in significant suffering and economic costs. Millions of sufferers remain undiagnosed and untreated due to a shortage of trained personnel, social stigma, and expensive treatments. Two novel machine learning architectures, used to predict depression severity from audio recordings, are presented and compared in this study. The data was taken from the Distress Analysis Interview Corpus, which contains recordings of 189 participant interviews and their Public Health Questionnaire 8 depression scores. Feature extraction and feature selection were performed on the participants' speech, and two machine learning architectures were designed to provide prediction models for depression severity. In the first architecture, participants' data were initially classified into depressed or not-depressed classes, and a regression model was trained on each class. The second architecture sorted the data into depression severity classes, which were then used in addition to the original features to predict the depression scores. The second architecture outperformed the first in both the classification and regression stages, achieving an RMSE value of 4.1, a significant improvement over previous studies that reported RMSE values of 6.32 to 6.94 for the same data. The results demonstrate a potential for a speech-based depression screening tool, able to assist healthcare professionals in the diagnosis and monitoring of patients, and to provide a scalable depression screening method enabling individuals to recognise their illnesses and seek professional help.

공지

DAU Library

eArticles

요약정보

Automated Classification of Depression Severity Using Speech - A Comparison of Two Machine Learning Architectures

Online Access

초록