DE-ABUSE@TamilNLP-ACL 2022: Transliteration as Data Augmentation for Abuse Detection in Tamil

2022-05-01DravidianLangTech (ACL) 2022Unverified0· sign in to hype

Vasanth Palanikumar, Sean Benhur, Adeep Hande, Bharathi Raja Chakravarthi

Unverified — Be the first to reproduce this paper.

Abstract

With the rise of social media and internet, thereis a necessity to provide an inclusive space andprevent the abusive topics against any gender,race or community. This paper describes thesystem submitted to the ACL-2022 shared taskon fine-grained abuse detection in Tamil. In ourapproach we transliterated code-mixed datasetas an augmentation technique to increase thesize of the data. Using this method we wereable to rank 3rd on the task with a 0.290 macroaverage F1 score and a 0.590 weighted F1score

Tasks

Abuse Detection Data Augmentation Transliteration

DE-ABUSE@TamilNLP-ACL 2022: Transliteration as Data Augmentation for Abuse Detection in Tamil

Abstract

Tasks

Reproductions