Topic Models like Latent Dirichlet Allocation have been widely used for their robustness in estimating text models through mixtures of latent topics. Although LDA has been mostly used as a strictly lexicalized approach, it can be effectively applicable to a much richer set of linguistic structures. A novel application of LDA is here presented that acquires suitable grammatical generalizations for semantic tasks tightly dependent on NL syntax. We show how the resulting topics represent suitable generalizations over syntactic structures and lexical information as well. The evaluation on two different classification tasks, such as predicate recognition and question classification, shows that state of the art results are obtained. © 2011 Springer-Verlag Berlin Heidelberg.

Basili, R., Giannone, C., Croce, D., Domeniconi, C. (2011). Latent topic models of surface syntactic information. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp.225-237) [10.1007/978-3-642-23954-0_22].

Latent topic models of surface syntactic information

BASILI, ROBERTO;CROCE, DANILO;
2011-09-01

Abstract

Topic Models like Latent Dirichlet Allocation have been widely used for their robustness in estimating text models through mixtures of latent topics. Although LDA has been mostly used as a strictly lexicalized approach, it can be effectively applicable to a much richer set of linguistic structures. A novel application of LDA is here presented that acquires suitable grammatical generalizations for semantic tasks tightly dependent on NL syntax. We show how the resulting topics represent suitable generalizations over syntactic structures and lexical information as well. The evaluation on two different classification tasks, such as predicate recognition and question classification, shows that state of the art results are obtained. © 2011 Springer-Verlag Berlin Heidelberg.
12th International Conference of the Italian Association for Artificial Intelligence, AI*IA 2011
Palermo, ita
2011
University of Palermo
Rilevanza nazionale
set-2011
Settore ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI
Settore INF/01 - INFORMATICA
English
Computer Science (all); Theoretical Computer Science
Intervento a convegno
Basili, R., Giannone, C., Croce, D., Domeniconi, C. (2011). Latent topic models of surface syntactic information. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp.225-237) [10.1007/978-3-642-23954-0_22].
Basili, R; Giannone, C; Croce, D; Domeniconi, C
File in questo prodotto:
File Dimensione Formato  
paper#0054.pdf

solo utenti autorizzati

Licenza: Copyright dell'editore
Dimensione 293.26 kB
Formato Adobe PDF
293.26 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/124160
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
social impact