Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign

2018-08-01COLING 2018Unverified0· sign in to hype

Marcos Zampieri, Shervin Malmasi, Preslav Nakov, Ahmed Ali, Suwon Shon, James Glass, Yves Scherrer, Tanja Samard{\v{z}}i{\'c}, Nikola Ljube{\v{s}}i{\'c}, J{\"o}rg Tiedemann, Chris van der Lee, Stefan Grondelaers, Nelleke Oostdijk, Dirk Speelman, Antal Van den Bosch, Ritesh Kumar, Bornini Lahiri, Mayank Jain

arXiv PDF

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We present the results and the findings of the Second VarDial Evaluation Campaign on Natural Language Processing (NLP) for Similar Languages, Varieties and Dialects. The campaign was organized as part of the fifth edition of the VarDial workshop, collocated with COLING'2018. This year, the campaign included five shared tasks, including two task re-runs -- Arabic Dialect Identification (ADI) and German Dialect Identification (GDI) --, and three new tasks -- Morphosyntactic Tagging of Tweets (MTT), Discriminating between Dutch and Flemish in Subtitles (DFS), and Indo-Aryan Language Identification (ILI). A total of 24 teams submitted runs across the five shared tasks, and contributed 22 system description papers, which were included in the VarDial workshop proceedings and are referred to in this report.

Tasks

Dependency Parsing Dialect Identification Language Identification

Language Identification and Morphosyntactic Tagging: The Second VarDial Evaluation Campaign

Abstract

Tasks

Reproductions