SOTAVerified

Experiments in Language Variety Geolocation and Dialect Identification

2020-12-01VarDial (COLING) 2020Unverified0· sign in to hype

Tommi Jauhiainen, Heidi Jauhiainen, Krister Lindén

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

In this paper we describe the systems we used when participating in the VarDial Evaluation Campaign organized as part of the 7th workshop on NLP for similar languages, varieties and dialects. The shared tasks we participated in were the second edition of the Romanian Dialect Identification (RDI) and the first edition of the Social Media Variety Geolocation (SMG). The submissions of our SUKI team used generative language models based on Naive Bayes and character n-grams.

Tasks

Reproductions