Findings of the Shared Task on Offensive Language Identification in Tamil, Malayalam, and Kannada

2021-04-01EACL (DravidianLangTech) 2021Unverified0· sign in to hype

Bharathi Raja Chakravarthi, Ruba Priyadharshini, Navya Jose, Anand Kumar M, Thomas Mandl, Prasanna Kumar Kumaresan, Rahul Ponnusamy, Hariharan R L, John P. McCrae, Elizabeth Sherly

arXiv PDF

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Detecting offensive language in social media in local languages is critical for moderating user-generated content. Thus, the field of offensive language identification in under-resourced Tamil, Malayalam and Kannada languages are essential. As the user-generated content is more code-mixed and not well studied for under-resourced languages, it is imperative to create resources and conduct benchmarking studies to encourage research in under-resourced Dravidian languages. We created a shared task on offensive language detection in Dravidian languages. We summarize here the dataset for this challenge which are openly available at https://competitions.codalab.org/competitions/27654, and present an overview of the methods and the results of the competing systems.

Tasks

Benchmarking Language Identification

Findings of the Shared Task on Offensive Language Identification in Tamil, Malayalam, and Kannada

Abstract

Tasks

Reproductions