Multilingual Cross-domain Perspectives on Online Hate Speech

2018-09-11Unverified0· sign in to hype

Tom De Smedt, Sylvia Jaki, Eduan Kotzé, Leïla Saoud, Maja Gwóźdź, Guy De Pauw, Walter Daelemans

Unverified — Be the first to reproduce this paper.

Abstract

In this report, we present a study of eight corpora of online hate speech, by demonstrating the NLP techniques that we used to collect and analyze the jihadist, extremist, racist, and sexist content. Analysis of the multilingual corpora shows that the different contexts share certain characteristics in their hateful rhetoric. To expose the main features, we have focused on text classification, text profiling, keyword and collocation extraction, along with manual annotation and qualitative study.

Tasks

General Classification text-classification Text Classification

Multilingual Cross-domain Perspectives on Online Hate Speech

Abstract

Tasks

Reproductions