SOTAVerified

Towards Automatic Generation of Messages Countering Online Hate Speech and Microaggressions

2022-07-01NAACL (WOAH) 2022Code Available0· sign in to hype

Mana Ashida, Mamoru Komachi

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

With the widespread use of social media, online hate is increasing, and microaggressions are receiving attention. We explore the potential for using pretrained language models to automatically generate messages that combat the associated offensive texts. Specifically, we focus on using prompting to steer model generation as it requires less data and computation than fine-tuning. We also propose a human evaluation perspective; offensiveness, stance, and informativeness. After obtaining 306 counterspeech and 42 microintervention messages generated by GPT-2, 3, Neo, we conducted a human evaluation using Amazon Mechanical Turk. The results indicate the potential of using prompting in the proposed generation task. All the generated texts along with the annotation are published to encourage future research on countering hate and microaggressions online.

Tasks

Reproductions