A central topic in Natural Language Processing (NLP) is the design of effective linguistic processors suitable for the target applications. Within this scenario, Convolution Kernels provide a powerful method to directly apply Machine Learning algorithms to complex structures representing linguistic information. The main topic of this work is the definition of the semantically Smoothed Partial Tree Kernel (SPTK), a generalized formulation of one of the most performant Convolution Kernels, i.e. the Tree Kernel (TK), by extending the similarity between tree structures with node similarities. The main characteristic of SPTK is its ability to measure the similarity between syntactic tree structures, which are partially similar and whose nodes can differ but are nevertheless semantically related. One of the most important outcomes is that SPTK allows for embedding external lexical information in the kernel function only through a similarity function among lexical nodes. The SPTK has been evaluated in three complex automatic Semantic Processing tasks: Question Classification in Question Answering, Verb Classification and Semantic Role Labeling. Although these tasks address different problems, state-of-the-art results have been achieved in every evaluation.

Croce, D., Basili, R., Moschitti, A. (2015). Semantic tree kernels for statistical natural language learning. In Studies in Computational Intelligence (pp. 93-113). Springer Verlag [10.1007/978-3-319-14206-7_5].

Semantic tree kernels for statistical natural language learning

CROCE, DANILO;BASILI, ROBERTO;MOSCHITTI, ALESSANDRO
2015-03-01

Abstract

A central topic in Natural Language Processing (NLP) is the design of effective linguistic processors suitable for the target applications. Within this scenario, Convolution Kernels provide a powerful method to directly apply Machine Learning algorithms to complex structures representing linguistic information. The main topic of this work is the definition of the semantically Smoothed Partial Tree Kernel (SPTK), a generalized formulation of one of the most performant Convolution Kernels, i.e. the Tree Kernel (TK), by extending the similarity between tree structures with node similarities. The main characteristic of SPTK is its ability to measure the similarity between syntactic tree structures, which are partially similar and whose nodes can differ but are nevertheless semantically related. One of the most important outcomes is that SPTK allows for embedding external lexical information in the kernel function only through a similarity function among lexical nodes. The SPTK has been evaluated in three complex automatic Semantic Processing tasks: Question Classification in Question Answering, Verb Classification and Semantic Role Labeling. Although these tasks address different problems, state-of-the-art results have been achieved in every evaluation.
mar-2015
Settore ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI
Settore INF/01 - INFORMATICA
English
Rilevanza internazionale
Articolo scientifico in atti di convegno
classification; Kernel methods; Semantic role labeling Verb; Tree kernels; Artificial Intelligence
http://www.springer.com/series/7092
Croce, D., Basili, R., Moschitti, A. (2015). Semantic tree kernels for statistical natural language learning. In Studies in Computational Intelligence (pp. 93-113). Springer Verlag [10.1007/978-3-319-14206-7_5].
Croce, D; Basili, R; Moschitti, A
Contributo in libro
File in questo prodotto:
File Dimensione Formato  
331972_1_En_5_Chapter_Author.pdf

solo utenti autorizzati

Licenza: Copyright dell'editore
Dimensione 707.9 kB
Formato Adobe PDF
707.9 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/124024
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 3
  • ???jsp.display-item.citation.isi??? ND
social impact