Neural Learning for Question Answering in Italian

IRIS

The recent breakthroughs in the field of deep learning have lead to state-of-the-art results in several NLP tasks such as Question Answering (QA). Nevertheless, the training requirements in cross-linguistic settings are not satisfied: the datasets suitable for training of question answering systems for non English languages are often not available, which represents a significant barrier for most neural methods. This paper explores the possibility of acquiring a large scale although lower quality dataset for an open-domain factoid questions answering system in Italian. It consists of more than 60 thousands question-answer pairs and was used to train a system able to answer factoid questions against the Italian Wikipedia. The paper describes the dataset and the experiments, inspired by an equivalent counterpart for English. These show that results achievable for Italian are worse, even though they are already applicable to concrete QA tasks.

Croce, D., Zelenanska, A., Basili, R. (2018). Neural Learning for Question Answering in Italian. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp.389-402). Springer Verlag [10.1007/978-3-030-03840-3_29].

Neural Learning for Question Answering in Italian

Croce, Danilo;Zelenanska, Alexandra;Basili, Roberto

2018-11-23

Abstract

The recent breakthroughs in the field of deep learning have lead to state-of-the-art results in several NLP tasks such as Question Answering (QA). Nevertheless, the training requirements in cross-linguistic settings are not satisfied: the datasets suitable for training of question answering systems for non English languages are often not available, which represents a significant barrier for most neural methods. This paper explores the possibility of acquiring a large scale although lower quality dataset for an open-domain factoid questions answering system in Italian. It consists of more than 60 thousands question-answer pairs and was used to train a system able to answer factoid questions against the Italian Wikipedia. The paper describes the dataset and the experiments, inspired by an equivalent counterpart for English. These show that results achievable for Italian are worse, even though they are already applicable to concrete QA tasks.

Scheda breve

Scheda completa

Scheda completa (DC)

	Nome del convegno
	
				17th Conference of the Italian Association for Artificial Intelligence, AI*IA 2018
			
	Luogo del convegno
	
				ita
			
	Anno del convegno
	
				2018
			
	Rilevanza del convegno
	
				Rilevanza internazionale
			
	Data di pubblicazione
	
				23-nov-2018
			
	DOI dell'intervento
	
				https://dx.doi.org/10.1007/978-3-030-03840-3_29
			
	Settore disciplinare dell'intervento (valido fino a 24/06/2024)
	
				Settore INF/01 - INFORMATICA
Settore ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI
			
	Lingua del contenuto
	
				English
			
	Parole chiave
	
				Theoretical Computer Science; Computer Science (all)
			
	URL alternativo
	
				https://www.springer.com/series/558
			
	Tipologia
	
				Intervento a convegno
			
	Citazione
	
				Croce, D., Zelenanska, A., Basili, R. (2018). Neural Learning for Question Answering in Italian. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (pp.389-402). Springer Verlag [10.1007/978-3-030-03840-3_29].
			
	Tutti gli autori
	
						Croce, D; Zelenanska, A; Basili, R
					
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/208681

Citazioni

ND

42

20

social impact