Pathological voice signals, especially when affected by artifacts such as hoarseness or wetness, can be hard to analyze with traditional transform-based methods. Thus, we employed 3D phase-space attractors stemming from non-linear chaos theory, to build a pipeline for Machine Learning (ML) based classification of dysphonic versus healthy voices. Spatial and analytical features were extracted from the attractors and paired with features related to the F0 of the signal, then two feature selection methodologies were employed and fed four different ML classifiers. Results show how even a small amount of features bring promising classification accuracies, in turn confirming the potential of attractors as descriptors of pathological voices. The best-performing model is a linear SVM fed with 10 features selected using Kruskal Wallis' test, and attractors related to pathological voices are more erratic, less regular, less prone to being watertight, and with higher volume and face area.

Cesarini, V., Magliocchetti, M., Calicchia, D., Amato, F., Suppa, A., Asci, F., et al. (2024). Features of 3-D phase space attractors as descriptors of chaotic laws in voice signals for the AI-based detection of dysphonia. In 2024 IEEE International Workshop on Metrology for Industry 4.0 and IoT: proceedings (pp.435-439). New York : IEEE [10.1109/MetroInd4.0IoT61288.2024.10584185].

Features of 3-D phase space attractors as descriptors of chaotic laws in voice signals for the AI-based detection of dysphonia

Cesarini V.;Amato F.;Saggio G.;Costantini G.
2024-01-01

Abstract

Pathological voice signals, especially when affected by artifacts such as hoarseness or wetness, can be hard to analyze with traditional transform-based methods. Thus, we employed 3D phase-space attractors stemming from non-linear chaos theory, to build a pipeline for Machine Learning (ML) based classification of dysphonic versus healthy voices. Spatial and analytical features were extracted from the attractors and paired with features related to the F0 of the signal, then two feature selection methodologies were employed and fed four different ML classifiers. Results show how even a small amount of features bring promising classification accuracies, in turn confirming the potential of attractors as descriptors of pathological voices. The best-performing model is a linear SVM fed with 10 features selected using Kruskal Wallis' test, and attractors related to pathological voices are more erratic, less regular, less prone to being watertight, and with higher volume and face area.
7th IEEE International Workshop on Metrology for Industry 4.0 and IoT, MetroInd4.0 and IoT 2024
Florence, Italy
2024
7
IEEE Italy Section Affinity Group of Women in Engineering
Rilevanza internazionale
2024
Settore IIET-01/A - Elettrotecnica
English
AI
Attractors
Chaos
Dysphonia
Voice
Intervento a convegno
Cesarini, V., Magliocchetti, M., Calicchia, D., Amato, F., Suppa, A., Asci, F., et al. (2024). Features of 3-D phase space attractors as descriptors of chaotic laws in voice signals for the AI-based detection of dysphonia. In 2024 IEEE International Workshop on Metrology for Industry 4.0 and IoT: proceedings (pp.435-439). New York : IEEE [10.1109/MetroInd4.0IoT61288.2024.10584185].
Cesarini, V; Magliocchetti, M; Calicchia, D; Amato, F; Suppa, A; Asci, F; Marsili, L; Saggio, G; Costantini, G
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/404587
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 0
  • ???jsp.display-item.citation.isi??? ND
social impact