Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures

IRIS

alongside the currently used nasal swab testing, the COVID-19 pandemic situation would gain notice-able advantages from low-cost tests that are available at any-time, anywhere, at a large-scale, and with real time answers. a novel approach for COVID-19 assessment is adopted here, discriminating negative subjects versus positive or recovered subjects. the scope is to identify potential discriminating features, highlight mid and short-term effects of COVID on the voice and compare two custom algorithms. a pool of 310 subjects took part in the study; recordings were collected in a low-noise, controlled setting employing three different vocal tasks. Binary classifications followed, using two different custom algorithms. the first was based on the coupling of boosting and bagging, with an ada boost classifier using random forest learners. a feature selection process was employed for the training, identifying a subset of features acting as clinically relevant biomarkers. the other approach was centered on two custom CNN architectures applied to mel-Spectrograms, with a custom knowledge-based data augmentation. performances, evaluated on an independent test set, were comparable: adaboost and CNN differentiated COVID-19 positive from negative with accuracies of 100% and 95% respectively, and recovered from negative individuals with accuracies of 86.1% and 75% respectively. this study highlights the possibility to identify COVID-19 positive subjects, foreseeing a tool for on-site screening, while also considering recovered subjects and the effects of COVID-19 on the voice. the two proposed novel architectures allow for the identification of biomarkers and demonstrate the ongoing relevance of traditional ML versus deep learning in speech analysis. (C) 2022 Elsevier B.V. all rights reserved.

Costantini, G., Dr, V.c., Robotti, C., Benazzo, M., Pietrantonio, F., Di Girolamo, S., et al. (2022). Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures. KNOWLEDGE-BASED SYSTEMS, 253, 1-13 [10.1016/j.knosys.2022.109539].

Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures

Costantini, Giovanni;Dr, Valerio Cesarini;Robotti, Carlo;Benazzo, Marco;Pietrantonio, Filomena;Di Girolamo, Stefano;Pisani, Antonio;Canzi, Pietro;Mauramati, Simone;Bertino, Giulia;Cassaniti, Irene;Baldanti, Fausto;Saggio, Giovanni

2022-10-11

Abstract

alongside the currently used nasal swab testing, the COVID-19 pandemic situation would gain notice-able advantages from low-cost tests that are available at any-time, anywhere, at a large-scale, and with real time answers. a novel approach for COVID-19 assessment is adopted here, discriminating negative subjects versus positive or recovered subjects. the scope is to identify potential discriminating features, highlight mid and short-term effects of COVID on the voice and compare two custom algorithms. a pool of 310 subjects took part in the study; recordings were collected in a low-noise, controlled setting employing three different vocal tasks. Binary classifications followed, using two different custom algorithms. the first was based on the coupling of boosting and bagging, with an ada boost classifier using random forest learners. a feature selection process was employed for the training, identifying a subset of features acting as clinically relevant biomarkers. the other approach was centered on two custom CNN architectures applied to mel-Spectrograms, with a custom knowledge-based data augmentation. performances, evaluated on an independent test set, were comparable: adaboost and CNN differentiated COVID-19 positive from negative with accuracies of 100% and 95% respectively, and recovered from negative individuals with accuracies of 86.1% and 75% respectively. this study highlights the possibility to identify COVID-19 positive subjects, foreseeing a tool for on-site screening, while also considering recovered subjects and the effects of COVID-19 on the voice. the two proposed novel architectures allow for the identification of biomarkers and demonstrate the ongoing relevance of traditional ML versus deep learning in speech analysis. (C) 2022 Elsevier B.V. all rights reserved.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				11-ott-2022
			
	Status di pubblicazione
	
				Pubblicato
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1016/j.knosys.2022.109539
			
	Rilevanza
	
				Rilevanza internazionale
			
	Tipo
	
				Articolo
			
	Referee
	
				Esperti anonimi
			
	Settore disciplinare dell'articolo (valido fino a 24/06/2024)
	
				Settore ING-INF/01 - ELETTRONICA
			
	Lingua del contenuto
	
				English
			
	Parole chiave
	
				1E, Vowel /e/ vocal task ;   2S, Sentence vocal task ;  3C, Cough vocal task;   Adaboost;   CFS, Correlation-based Feature Selection;
CNN, Convolutional Neural Network;  COVID-19;  Classification; DL, Deep Learning; Deep learning; H, Healthy control subjects;
H, Healthy control subjects; ML, Machine Learning; NS, Nasal Swab; P, Positive subjects; PCR, Polymerase Chain Reaction-based molecular swabs;
PvsH, Positive versus Healthy subjects comparison; R, Recovered subjects; RF, Random Forest; ROC, Receiver-Operating Curve;
ReLu, Rectified Linear Unit; RvsH, Recovered versus Healthy subjects comparison; SVM, Support Vector Machine; Speech processing.
			
	Citazione
	
				Costantini, G., Dr, V.c., Robotti, C., Benazzo, M., Pietrantonio, F., Di Girolamo, S., et al. (2022). Deep learning and machine learning-based voice analysis for the detection of COVID-19: A proposal and comparison of architectures. KNOWLEDGE-BASED SYSTEMS, 253, 1-13 [10.1016/j.knosys.2022.109539].
			
	Tutti gli autori
	
						Costantini, G; Dr, Vc; Robotti, C; Benazzo, M; Pietrantonio, F; Di Girolamo, S; Pisani, A; Canzi, P; Mauramati, S; Bertino, G; Cassaniti, I; Baldanti,...espandi
						
	Tipologia
	
				Articolo su rivista
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S0950705122007754-main.pdf solo utenti autorizzati Tipologia: Versione Editoriale (PDF) Licenza: Copyright dell'editore Dimensione 2.51 MB Formato Adobe PDF Visualizza/Apri Richiedi una copia	2.51 MB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/320885

Citazioni

8

27

15

social impact