Adductor-type spasmodic dysphonia (ASD) is a task-specific speech disorder characterized by a strangled and strained voice. We have previously demonstrated that advanced voice analysis, performed with support vector machine, can objectively quantify voice impairment in dysphonic patients, also evidencing results of voice improvements due to symptomatic treatment with botulinum neurotoxin type-A injections into the vocal cords. Here, we expanded the analysis by means of three different machine learning algorithms (Support Vector Machine, Naïve Bayes and Multilayer Percept), on a cohort of 60 ASD patients, some of them also treated with botulinum neurotoxin type A therapy, and 60 age and gender-matched healthy subjects. Our analysis was based on sounds produced by speakers during the emission of /a/ and /e/ sustained vowels and a standardized sentence. As a conclusion, we report the main features with discriminatory capabilities to distinguish untreated vs. treated ASD patients vs. healthy subjects, and a comparison of the three classifiers with respect to their discriminating accuracy.
Costantini, G., Leo, P.d., Asci, F., Zarezadeh, Z., Marsili, L., Errico, V., et al. (2021). Machine learning based voice analysis in spasmodic dysphonia: an investigation of most relevant features from specific vocal tasks. In Biosignals: proceedings of the 14th International Joint Conference on Biomedical Engineering Systems and Technologies (pp.103-113). SciTePress.
Machine learning based voice analysis in spasmodic dysphonia: an investigation of most relevant features from specific vocal tasks
Costantini G.;Saggio G.
2021-01-01
Abstract
Adductor-type spasmodic dysphonia (ASD) is a task-specific speech disorder characterized by a strangled and strained voice. We have previously demonstrated that advanced voice analysis, performed with support vector machine, can objectively quantify voice impairment in dysphonic patients, also evidencing results of voice improvements due to symptomatic treatment with botulinum neurotoxin type-A injections into the vocal cords. Here, we expanded the analysis by means of three different machine learning algorithms (Support Vector Machine, Naïve Bayes and Multilayer Percept), on a cohort of 60 ASD patients, some of them also treated with botulinum neurotoxin type A therapy, and 60 age and gender-matched healthy subjects. Our analysis was based on sounds produced by speakers during the emission of /a/ and /e/ sustained vowels and a standardized sentence. As a conclusion, we report the main features with discriminatory capabilities to distinguish untreated vs. treated ASD patients vs. healthy subjects, and a comparison of the three classifiers with respect to their discriminating accuracy.File | Dimensione | Formato | |
---|---|---|---|
Machine Learning based Voice Analysis in Spasmodic Dysphonia .pdf
solo utenti autorizzati
Tipologia:
Versione Editoriale (PDF)
Licenza:
Copyright dell'editore
Dimensione
509.92 kB
Formato
Adobe PDF
|
509.92 kB | Adobe PDF | Visualizza/Apri Richiedi una copia |
I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.