Cross-Lingual Sentiment Quantification
Andrea Esuli, Alejandro Moreo, Fabrizio Sebastiani
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/HLT-ISTI/cl-quantOfficialIn papernone★ 0
- github.com/AlexMoreo/pydciOfficialIn papernone★ 0
- github.com/AlexMoreo/cl-quantOfficialIn papernone★ 0
Abstract
Sentiment Quantification (i.e., the task of estimating the relative frequency of sentiment-related classes -- such as Positive and Negative -- in a set of unlabelled documents) is an important topic in sentiment analysis, as the study of sentiment-related quantities and trends across a population is often of higher interest than the analysis of individual instances. In this work we propose a method for Cross-Lingual Sentiment Quantification, the task of performing sentiment quantification when training documents are available for a source language S but not for the target language T for which sentiment quantification needs to be performed. Cross-lingual sentiment quantification (and cross-lingual text quantification in general) has never been discussed before in the literature; we establish baseline results for the binary case by combining state-of-the-art quantification methods with methods capable of generating cross-lingual vectorial representations of the source and target documents involved. We present experimental results obtained on publicly available datasets for cross-lingual sentiment classification; the results show that the presented methods can perform cross-lingual sentiment quantification with a surprising level of accuracy.