SOTAVerified

BB_twtr at SemEval-2017 Task 4: Twitter Sentiment Analysis with CNNs and LSTMs

2017-04-20SEMEVAL 2017Code Available0· sign in to hype

Mathieu Cliche

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

In this paper we describe our attempt at producing a state-of-the-art Twitter sentiment classifier using Convolutional Neural Networks (CNNs) and Long Short Term Memory (LSTMs) networks. Our system leverages a large amount of unlabeled data to pre-train word embeddings. We then use a subset of the unlabeled data to fine tune the embeddings using distant supervision. The final CNNs and LSTMs are trained on the SemEval-2017 Twitter dataset where the embeddings are fined tuned again. To boost performances we ensemble several CNNs and LSTMs together. Our approach achieved first rank on all of the five English subtasks amongst 40 teams.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
SemEvalLSTMs+CNNs ensemble with multiple conv. opsF1-score0.69Unverified
SemEval 2017 Task 4-ALSTMs+CNNs ensemble with multiple conv. opsAverage Recall0.68Unverified

Reproductions