SOTAVerified

An Interactive Exploratory Tool for the Task of Hate Speech Detection

2022-07-01NAACL (HCINLP) 2022Unverified0· sign in to hype

Angelina McMillan-Major, Amandalynne Paullada, Yacine Jernite

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

With the growth of Automatic Content Moderation (ACM) on widely used social media platforms, transparency into the design of moderation technology and policy is necessary for online communities to advocate for themselves when harms occur.In this work, we describe a suite of interactive modules to support the exploration of various aspects of this technology, and particularly of those components that rely on English models and datasets for hate speech detection, a subtask within ACM. We intend for this demo to support the various stakeholders of ACM in investigating the definitions and decisions that underpin current technologies such that those with technical knowledge and those with contextual knowledge may both better understand existing systems.

Tasks

Reproductions