autism spectrum disorder (ASD) is a heterogeneous condition, characterized by complex genetic architectures and intertwined genetic/environmental interactions. novel analysis approaches to disentangle its pathophysiology by computing large amounts of data are needed. we present an advanced machine learning technique, based on a clustering analysis on genotypical/phenotypical embedding spaces, to identify biological processes that might act as pathophysiological substrates for ASD. this technique was applied to the varicarta database, which contained 187,794 variant events retrieved from 15,189 individuals with ASD. Nine clusters of ASD-related genes were identified. the 3 largest clusters included 68.6% of all individuals, consisting of 1455 (38.0%), 841 (21.9%), and 336 (8.7%) persons, respectively. enrichment analysis was applied to isolate clinically relevant ASD-associated biological processes. two of the identified clusters were characterized by individuals with an increased presence of variants linked to biological processes and cellular components, such as axon growth and guidance, synaptic membrane components, or transmission. the study also suggested other clusters with possible genotype-phenotype associations. Innovative methodologies, including machine learning, can improve our understanding of the underlying biological processes and gene variant networks that undergo the etiology and pathogenic mechanisms of ASD. future work to ascertain the reproducibility of the presented methodology is warranted.

Di Giovanni, D., Enea, R., Di Micco, V., Benvenuto, A., Curatolo, P., Emberti Gialloreti, L. (2023). Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder. GENES, 14(2), 313 [10.3390/genes14020313].

Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder

Di Giovanni, Daniele;Benvenuto, Arianna;Curatolo, Paolo;Emberti Gialloreti, Leonardo
2023-01-25

Abstract

autism spectrum disorder (ASD) is a heterogeneous condition, characterized by complex genetic architectures and intertwined genetic/environmental interactions. novel analysis approaches to disentangle its pathophysiology by computing large amounts of data are needed. we present an advanced machine learning technique, based on a clustering analysis on genotypical/phenotypical embedding spaces, to identify biological processes that might act as pathophysiological substrates for ASD. this technique was applied to the varicarta database, which contained 187,794 variant events retrieved from 15,189 individuals with ASD. Nine clusters of ASD-related genes were identified. the 3 largest clusters included 68.6% of all individuals, consisting of 1455 (38.0%), 841 (21.9%), and 336 (8.7%) persons, respectively. enrichment analysis was applied to isolate clinically relevant ASD-associated biological processes. two of the identified clusters were characterized by individuals with an increased presence of variants linked to biological processes and cellular components, such as axon growth and guidance, synaptic membrane components, or transmission. the study also suggested other clusters with possible genotype-phenotype associations. Innovative methodologies, including machine learning, can improve our understanding of the underlying biological processes and gene variant networks that undergo the etiology and pathogenic mechanisms of ASD. future work to ascertain the reproducibility of the presented methodology is warranted.
25-gen-2023
Pubblicato
Rilevanza internazionale
Articolo
Esperti anonimi
Settore MED/01 - STATISTICA MEDICA
English
Autism spectrum disorder (ASD) cluster analysis; connectivity; gene networks; genotype–phenotype embedding; machine learning
neurite morphogenesis; neurobehavioral phenotypes ; neurotransmission; patient similarity analytics; synapses
Di Giovanni, D., Enea, R., Di Micco, V., Benvenuto, A., Curatolo, P., Emberti Gialloreti, L. (2023). Using Machine Learning to Explore Shared Genetic Pathways and Possible Endophenotypes in Autism Spectrum Disorder. GENES, 14(2), 313 [10.3390/genes14020313].
Di Giovanni, D; Enea, R; Di Micco, V; Benvenuto, A; Curatolo, P; Emberti Gialloreti, L
Articolo su rivista
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/329229
Citazioni
  • ???jsp.display-item.citation.pmc??? 0
  • Scopus 3
  • ???jsp.display-item.citation.isi??? 1
social impact