Injecting sentiment information in context-aware Convolutional Neural Networks

IRIS

Deep learning models achieved remarkable results in Computer Vision, Speech recognition, Natural Language Processing and Information Retrieval. In this work, we extend a Convolutional Neural Networks (CNNs) for the Sentiment Analysis in Twitter task, as this architecture achieved state-of-the-art results in [3, 4]. In particular, this architecture has been shown effective when a proper pre-training step is adopted to perform the early estimation of the network parameters: in [4] it is suggested to generate pre-training data starting from a randomic selection of Twitter messages annotated with simple heuristics, e.g. the presence of specific emoticons in messages. We improve the quality of such CNN architecture in two ways. First, we propose to adopt a contextual model [5] to select pre-training material from the conversations to which training messages appear, as opposed to an arbitrary selection of messages. In this way, we aim at selecting pre-training messages that could better reflect the topics of the targeted data. Second, we promote the adoption of a multi-channel schema [3] for representing the input data to the CNN. A first channel is used to accommodate lexical information provided by Distributional Models of Lexical Semantics, i.e. a vector representation of words provided by a Word Embedding. The second channel is adopted to represent sentiment oriented information as it is provided by a polarity lexicon. In particular, the sentiment oriented vectors adopted in this study refer to the automatically acquired Distributional Polarity Lexicons, as proposed in [1]. The experimental evaluation shows that the proposed solutions are beneficial w.r.t the targeted task in two languages, i.e. English and Italian. The full version of this paper is provided in [2] and it is available in the SocialNLP@IJCAI 2016 proceedings.

Croce, D., Castellucci, G., Basili, R. (2016). Injecting sentiment information in context-aware Convolutional Neural Networks. In CEUR Workshop Proceedings. CEUR-WS.

Injecting sentiment information in context-aware Convolutional Neural Networks

CROCE, DANILO;CASTELLUCCI, GIUSEPPE;BASILI, ROBERTO

2016-01-01

Abstract

Deep learning models achieved remarkable results in Computer Vision, Speech recognition, Natural Language Processing and Information Retrieval. In this work, we extend a Convolutional Neural Networks (CNNs) for the Sentiment Analysis in Twitter task, as this architecture achieved state-of-the-art results in [3, 4]. In particular, this architecture has been shown effective when a proper pre-training step is adopted to perform the early estimation of the network parameters: in [4] it is suggested to generate pre-training data starting from a randomic selection of Twitter messages annotated with simple heuristics, e.g. the presence of specific emoticons in messages. We improve the quality of such CNN architecture in two ways. First, we propose to adopt a contextual model [5] to select pre-training material from the conversations to which training messages appear, as opposed to an arbitrary selection of messages. In this way, we aim at selecting pre-training messages that could better reflect the topics of the targeted data. Second, we promote the adoption of a multi-channel schema [3] for representing the input data to the CNN. A first channel is used to accommodate lexical information provided by Distributional Models of Lexical Semantics, i.e. a vector representation of words provided by a Word Embedding. The second channel is adopted to represent sentiment oriented information as it is provided by a polarity lexicon. In particular, the sentiment oriented vectors adopted in this study refer to the automatically acquired Distributional Polarity Lexicons, as proposed in [1]. The experimental evaluation shows that the proposed solutions are beneficial w.r.t the targeted task in two languages, i.e. English and Italian. The full version of this paper is provided in [2] and it is available in the SocialNLP@IJCAI 2016 proceedings.

Scheda breve

Scheda completa

Scheda completa (DC)

	Nome del convegno
	
				7th Italian Information Retrieval Workshop, IIR 2016
			
	Luogo del convegno
	
				ita
			
	Anno del convegno
	
				2016
			
	Rilevanza del convegno
	
				Rilevanza nazionale
			
	Data di pubblicazione
	
				1-gen-2016
			
	Settore disciplinare dell'intervento (valido fino a 24/06/2024)
	
				Settore ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI
Settore INF/01 - INFORMATICA
			
	Lingua del contenuto
	
				English
			
	Parole chiave
	
				Computer Science (all)
			
	URL alternativo
	
				http://ceur-ws.org/
			
	Tipologia
	
				Intervento a convegno
			
	Citazione
	
				Croce, D., Castellucci, G., Basili, R. (2016). Injecting sentiment information in context-aware Convolutional Neural Networks. In CEUR Workshop Proceedings. CEUR-WS.
			
	Tutti gli autori
	
						Croce, D; Castellucci, G; Basili, R
					
	Appare nelle tipologie:
	
				02 - Intervento a convegno

File in questo prodotto:

Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/189339

Citazioni

ND

2

ND

social impact