Detecting cough recordings in crowdsourced data using CNN-RNN

Roneel V. Sharan, Hao Xiong, Shlomo Berkovsky

Research output: Chapter in Book/Report/Conference proceedingConference proceeding contributionpeer-review

4 Citations (Scopus)

Abstract

The sound of cough is an important indicator of the condition of the respiratory system. Automatic cough sound evaluation can aid the diagnosis of respiratory diseases. Large crowdsourced cough sound datasets have recently been used by several groups around the world to develop cough classification models. However, not all recordings in these datasets contain cough sounds. As such, it is important to screen the recordings for the presence of cough sounds before developing cough classification models. This work proposes a method to screen crowdsourced audio recordings for cough sounds using deep learning methods. The proposed approach divides the audio recording into overlapping frames and converts each frame into a mel-spectrogram representation. A pretrained convolutional neural network for audio classification is trained to learn the spectral characteristics of cough and non-cough frames from its mel-spectrogram representation. It is combined with a recurrent neural network to learn the dependencies between the sequence of frames. The proposed method is evaluated on 400 crowdsourced audio recordings, manually annotated as cough or non-cough. An accuracy of 0.9800 (AUC of 0.9973) is achieved in classifying cough and non-cough recordings using the proposed method. The trained network is used to analyze the remaining audio recordings in the dataset, identifying only about 67% of recordings as containing usable cough sounds. This shows the need to exercise caution when using crowdsourced cough data.
Original languageEnglish
Title of host publication2022 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI)
Place of PublicationGreece
PublisherInstitute of Electrical and Electronics Engineers (IEEE)
Number of pages4
ISBN (Electronic)9781665487917
ISBN (Print)9781665487924
DOIs
Publication statusPublished - 4 Nov 2022
Event2022 IEEE-EMBS International Conference on Biomedical and Health Informatics, BHI 2022 - Ioannina, Greece
Duration: 27 Sept 202230 Sept 2022

Conference

Conference2022 IEEE-EMBS International Conference on Biomedical and Health Informatics, BHI 2022
Country/TerritoryGreece
CityIoannina
Period27/09/2230/09/22

Keywords

  • cough sound
  • crowdsourced
  • deep learning
  • mel-spectrogram
  • respiratory diseases

Fingerprint

Dive into the research topics of 'Detecting cough recordings in crowdsourced data using CNN-RNN'. Together they form a unique fingerprint.

Cite this