Experiments in Language Variety Geolocation and Dialect Identification
2020-12-01VarDial (COLING) 2020Unverified0· sign in to hype
Tommi Jauhiainen, Heidi Jauhiainen, Krister Lindén
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
In this paper we describe the systems we used when participating in the VarDial Evaluation Campaign organized as part of the 7th workshop on NLP for similar languages, varieties and dialects. The shared tasks we participated in were the second edition of the Romanian Dialect Identification (RDI) and the first edition of the Social Media Variety Geolocation (SMG). The submissions of our SUKI team used generative language models based on Naive Bayes and character n-grams.