Deep learning models achieved remarkable results in Computer Vision, Speech recognition, Natural Language Processing and Information Retrieval. In this work, we extend a Convolutional Neural Networks (CNNs) for the Sentiment Analysis in Twitter task, as this architecture achieved state-of-the-art results in [3, 4]. In particular, this architecture has been shown effective when a proper pre-training step is adopted to perform the early estimation of the network parameters: in [4] it is suggested to generate pre-training data starting from a randomic selection of Twitter messages annotated with simple heuristics, e.g. the presence of specific emoticons in messages. We improve the quality of such CNN architecture in two ways. First, we propose to adopt a contextual model [5] to select pre-training material from the conversations to which training messages appear, as opposed to an arbitrary selection of messages. In this way, we aim at selecting pre-training messages that could better reflect the topics of the targeted data. Second, we promote the adoption of a multi-channel schema [3] for representing the input data to the CNN. A first channel is used to accommodate lexical information provided by Distributional Models of Lexical Semantics, i.e. a vector representation of words provided by a Word Embedding. The second channel is adopted to represent sentiment oriented information as it is provided by a polarity lexicon. In particular, the sentiment oriented vectors adopted in this study refer to the automatically acquired Distributional Polarity Lexicons, as proposed in [1]. The experimental evaluation shows that the proposed solutions are beneficial w.r.t the targeted task in two languages, i.e. English and Italian. The full version of this paper is provided in [2] and it is available in the SocialNLP@IJCAI 2016 proceedings.

Croce, D., Castellucci, G., Basili, R. (2016). Injecting sentiment information in context-aware Convolutional Neural Networks. In CEUR Workshop Proceedings. CEUR-WS.

Injecting sentiment information in context-aware Convolutional Neural Networks

CROCE, DANILO;CASTELLUCCI, GIUSEPPE;BASILI, ROBERTO
2016-01-01

Abstract

Deep learning models achieved remarkable results in Computer Vision, Speech recognition, Natural Language Processing and Information Retrieval. In this work, we extend a Convolutional Neural Networks (CNNs) for the Sentiment Analysis in Twitter task, as this architecture achieved state-of-the-art results in [3, 4]. In particular, this architecture has been shown effective when a proper pre-training step is adopted to perform the early estimation of the network parameters: in [4] it is suggested to generate pre-training data starting from a randomic selection of Twitter messages annotated with simple heuristics, e.g. the presence of specific emoticons in messages. We improve the quality of such CNN architecture in two ways. First, we propose to adopt a contextual model [5] to select pre-training material from the conversations to which training messages appear, as opposed to an arbitrary selection of messages. In this way, we aim at selecting pre-training messages that could better reflect the topics of the targeted data. Second, we promote the adoption of a multi-channel schema [3] for representing the input data to the CNN. A first channel is used to accommodate lexical information provided by Distributional Models of Lexical Semantics, i.e. a vector representation of words provided by a Word Embedding. The second channel is adopted to represent sentiment oriented information as it is provided by a polarity lexicon. In particular, the sentiment oriented vectors adopted in this study refer to the automatically acquired Distributional Polarity Lexicons, as proposed in [1]. The experimental evaluation shows that the proposed solutions are beneficial w.r.t the targeted task in two languages, i.e. English and Italian. The full version of this paper is provided in [2] and it is available in the SocialNLP@IJCAI 2016 proceedings.
7th Italian Information Retrieval Workshop, IIR 2016
ita
2016
Rilevanza nazionale
1-gen-2016
Settore ING-INF/05 - SISTEMI DI ELABORAZIONE DELLE INFORMAZIONI
Settore INF/01 - INFORMATICA
English
Computer Science (all)
http://ceur-ws.org/
Intervento a convegno
Croce, D., Castellucci, G., Basili, R. (2016). Injecting sentiment information in context-aware Convolutional Neural Networks. In CEUR Workshop Proceedings. CEUR-WS.
Croce, D; Castellucci, G; Basili, R
File in questo prodotto:
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/2108/189339
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 2
  • ???jsp.display-item.citation.isi??? ND
social impact