Demographic Word Embeddings for Racism Detection on Twitter

2017-11-01IJCNLP 2017Unverified0· sign in to hype

Mohammed Hasanuzzaman, Ga{\"e}l Dias, Andy Way

Unverified — Be the first to reproduce this paper.

Abstract

Most social media platforms grant users freedom of speech by allowing them to freely express their thoughts, beliefs, and opinions. Although this represents incredible and unique communication opportunities, it also presents important challenges. Online racism is such an example. In this study, we present a supervised learning strategy to detect racist language on Twitter based on word embedding that incorporate demographic (Age, Gender, and Location) information. Our methodology achieves reasonable classification accuracy over a gold standard dataset (F1=76.3\%) and significantly improves over the classification performance of demographic-agnostic models.

Tasks

Classification General Classification Word Embeddings

Demographic Word Embeddings for Racism Detection on Twitter

Abstract

Tasks

Reproductions