Machine learning- and statistical-based voice analysis of Parkinson's disease patients: a survey

IRIS

The preliminary diagnosis and evaluation of the presence and/or severity of Parkinson's disease is crucial in controlling the progress of the disease. Real-time, non-invasive methodologies based on machine learning-enhanced voice analysis are gathering more interest as the potential of this field unveils. Specifically, acoustic features are employed in many machine learning techniques, and could also function as indicators of the overall state of the subjects' voice: this review aims at identifying the most widely employed and promising feature-based machine learning methodologies, evidencing baselines and state-of-the-art solutions. A total of 102 works plus 5 review articles were selected from the IEEE Xplore, PubMed, Elsevier, and Web of Science electronic databases. A statistical assessment is performed identifying the most frequently used features as well as those deemed as most effective; an overview of algorithms, public datasets, toolboxes, and general metadata is also performed. According to our results, Jitter, Shimmer, Harmonic-to-Noise Ratio, Fundamental Frequency, and Mel Frequency Cepstral Coefficients are the mostly adopted features. In addition, it is worth noting a fair prevalence of glottal-like models and additional filtering options, such as Detrended Fluctuation Analysis.

Amato, F., Saggio, G., Cesarini, V., Olmo, G., Costantini, G. (2023). Machine learning- and statistical-based voice analysis of Parkinson's disease patients: a survey. EXPERT SYSTEMS WITH APPLICATIONS, 219 [10.1016/j.eswa.2023.119651].

Machine learning- and statistical-based voice analysis of Parkinson's disease patients: a survey

Amato, F;Saggio, G;Cesarini, V;Olmo, G;Costantini, G

2023-01-01

Abstract

The preliminary diagnosis and evaluation of the presence and/or severity of Parkinson's disease is crucial in controlling the progress of the disease. Real-time, non-invasive methodologies based on machine learning-enhanced voice analysis are gathering more interest as the potential of this field unveils. Specifically, acoustic features are employed in many machine learning techniques, and could also function as indicators of the overall state of the subjects' voice: this review aims at identifying the most widely employed and promising feature-based machine learning methodologies, evidencing baselines and state-of-the-art solutions. A total of 102 works plus 5 review articles were selected from the IEEE Xplore, PubMed, Elsevier, and Web of Science electronic databases. A statistical assessment is performed identifying the most frequently used features as well as those deemed as most effective; an overview of algorithms, public datasets, toolboxes, and general metadata is also performed. According to our results, Jitter, Shimmer, Harmonic-to-Noise Ratio, Fundamental Frequency, and Mel Frequency Cepstral Coefficients are the mostly adopted features. In addition, it is worth noting a fair prevalence of glottal-like models and additional filtering options, such as Detrended Fluctuation Analysis.

Scheda breve

Scheda completa

Scheda completa (DC)

	Data di pubblicazione
	
				2023
			
	Status di pubblicazione
	
				Pubblicato
			
	DOI dell'articolo
	
				https://dx.doi.org/10.1016/j.eswa.2023.119651
			
	Rilevanza
	
				Rilevanza internazionale
			
	Tipo
	
				Recensione
			
	Referee
	
				Esperti anonimi
			
	Settore disciplinare dell'articolo (valido fino a 24/06/2024)
	
				Settore ING-INF/01 - ELETTRONICA
			
	Lingua del contenuto
	
				English
			
	Parole chiave
	
				Parkinson's disease
Voice analysis
Machine analysis
Acoustic features
			
	Citazione
	
				Amato, F., Saggio, G., Cesarini, V., Olmo, G., Costantini, G. (2023). Machine learning- and statistical-based voice analysis of Parkinson's disease patients: a survey. EXPERT SYSTEMS WITH APPLICATIONS, 219 [10.1016/j.eswa.2023.119651].
			
	Tutti gli autori
	
						Amato, F; Saggio, G; Cesarini, V; Olmo, G; Costantini, G
					
	Tipologia
	
				Articolo su rivista
			
	Appare nelle tipologie:
	
				01 - Articolo su rivista

File in questo prodotto:

File	Dimensione	Formato
1-s2.0-S0957417423001525-main.pdf solo utenti autorizzati Tipologia: Versione Editoriale (PDF) Licenza: Copyright dell'editore Dimensione 948.88 kB Formato Adobe PDF Visualizza/Apri Richiedi una copia	948.88 kB	Adobe PDF	Visualizza/Apri Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/320883

Citazioni

ND

19

10

social impact