Deep learning for automatic image captioning in poor training conditions

IRIS

Recent advancements in Deep Learning show that the combination of Convolutional Neural Networks and Recurrent Neural Networks enables the definition of very effective methods for the automatic captioning of images. Unfortunately, this straightforward result requires the existence of large-scale corpora and they are not available for many languages. This paper describes a simple methodology to automatically acquire a large-scale corpus of 600 thousand image/sentences pairs in Italian. At the best of our knowledge, this corpus has been used to train one of the first neural systems for the same language. The experimental evaluation over a subset of validated image/captions pairs suggests that results comparable with the English counterpart can be achieved.

Masotti, C., Croce, D., Basili, R. (2017). Deep learning for automatic image captioning in poor training conditions. In CEUR Workshop Proceedings. CEUR-WS.

Deep learning for automatic image captioning in poor training conditions

Masotti, Caterina;Croce, Danilo;Basili, Roberto

2017-12-10

Abstract

Recent advancements in Deep Learning show that the combination of Convolutional Neural Networks and Recurrent Neural Networks enables the definition of very effective methods for the automatic captioning of images. Unfortunately, this straightforward result requires the existence of large-scale corpora and they are not available for many languages. This paper describes a simple methodology to automatically acquire a large-scale corpus of 600 thousand image/sentences pairs in Italian. At the best of our knowledge, this corpus has been used to train one of the first neural systems for the same language. The experimental evaluation over a subset of validated image/captions pairs suggests that results comparable with the English counterpart can be achieved.

Scheda breve

Scheda completa

Scheda completa (DC)

	Nome del convegno
	
				4th Italian Conference on Computational Linguistics, CLiC-it 2017
			
	Luogo del convegno
	
				ita
			
	Anno del convegno
	
				2017
			
	Rilevanza del convegno
	
				Rilevanza nazionale
			
	Data di pubblicazione
	
				10-dic-2017
			
	Settore disciplinare dell'intervento (valido fino a 24/06/2024)
	
				Settore ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI
Settore INF/01 - INFORMATICA
			
	Lingua del contenuto
	
				English
			
	Parole chiave
	
				Computer Science (all)
			
	URL alternativo
	
				http://ceur-ws.org/
			
	Tipologia
	
				Intervento a convegno
			
	Citazione
	
				Masotti, C., Croce, D., Basili, R. (2017). Deep learning for automatic image captioning in poor training conditions. In CEUR Workshop Proceedings. CEUR-WS.
			
	Tutti gli autori
	
						Masotti, C; Croce, D; Basili, R
					
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/198401

Citazioni

ND

1

ND

social impact