Automatic diagnosis of COVID-19 related respiratory diseases from speech

Kushan Shekhar; Nagaratna B Chittaragi; Shashidhar G Koolagudi

doi:10.1007/s11042-023-14923-y

Automatic diagnosis of COVID-19 related respiratory diseases from speech

Multimed Tools Appl. 2023 Mar 29:1-16. doi: 10.1007/s11042-023-14923-y. Online ahead of print.

Authors

Kushan Shekhar^#¹, Nagaratna B Chittaragi^#², Shashidhar G Koolagudi^#¹

Affiliations

¹ Department of CSE, National Institute of Technology Karnataka, Surathkal, Mangalore, Karnataka India.
² Department of ISE, Siddaganga Institute of Technology, B H Road, Tumakuru, Karnataka India.

^# Contributed equally.

Abstract

In this work, an attempt is made to propose an intelligent and automatic system to recognize COVID-19 related illnesses from mere speech samples by using automatic speech processing techniques. We used a standard crowd-sourced dataset which was collected by the University of Cambridge through a web based application and an android/iPhone app. We worked on cough and breath datasets individually, and also with a combination of both the datasets. We trained the datasets on two sets of features, one consisting of only standard audio features such as spectral and prosodic features and one combining excitation source features with standard audio features extracted, and trained our model on shallow classifiers such as ensemble classifiers and SVM classification methods. Our model has shown better performance on both breath and cough datasets, but the best results in each of the cases was obtained through different combinations of features and classifiers. We got our best result when we used only standard audio features, and combined both cough and breath data. In this case, we achieved an accuracy of 84% and an Area Under Curve (AUC) score of 84%. Intelligent systems have already started to make a mark in medical diagnosis, and this type of study can help better the health system by providing much needed assistance to the health workers.

Keywords: Breath; COVID-19; Cough; Excitation source features; Spectral features; Speech-based COVID analysis; Support vector machines.

© The Author(s), under exclusive licence to Springer Science+Business Media, LLC, part of Springer Nature 2023, Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.