SOTAVerified

Challenges in Neural Language Identification: NRC at VarDial 2020

2020-12-01VarDial (COLING) 2020Unverified0· sign in to hype

Gabriel Bernier-Colborne, Cyril Goutte

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We describe the systems developed by the National Research Council Canada for the Uralic language identification shared task at the 2020 VarDial evaluation campaign. Although our official results were well below the baseline, we show in this paper that this was not due to the neural approach to language identification in general, but to a flaw in the function we used to sample data for training and evaluation purposes. Preliminary experiments conducted after the evaluation period suggest that our neural approach to language identification can achieve state-of-the-art results on this task, although further experimentation is required.

Tasks

Reproductions