Sentiment Analysis for Multilingual Corpora
2019-08-01WS 2019Code Available0· sign in to hype
Svitlana Galeshchuk, Ju Qiu, Julien Jourdan
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/GSukr/Sentiment_Analysis_Multilingual_CorporaOfficialIn papernone★ 0
Abstract
The paper presents a generic approach to the supervised sentiment analysis of social media content in Slavic languages. The method proposes translating the documents from the original language to English with Google's Neural Translation Model. The resulted texts are then converted to vectors by averaging the vectorial representation of words derived from a pre-trained Word2Vec English model. Testing the approach with several machine learning methods on Polish, Slovenian and Croatian Twitter datasets returns up to 86\% of classification accuracy on the out-of-sample data.