SOTAVerified

Computationally efficient discrimination between language varieties with large feature vectors and regularized classifiers

2018-08-01COLING 2018Unverified0· sign in to hype

Adrien Barbaresi

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

The present contribution revolves around efficient approaches to language classification which have been field-tested in the Vardial evaluation campaign. The methods used in several language identification tasks comprising different language types are presented and their results are discussed, giving insights on real-world application of regularization, linear classifiers and corresponding linguistic features. The use of a specially adapted Ridge classifier proved useful in 2 tasks out of 3. The overall approach (XAC) has slightly outperformed most of the other systems on the DFS task (Dutch and Flemish) and on the ILI task (Indo-Aryan languages), while its comparative performance was poorer in on the GDI task (Swiss German dialects).

Tasks

Reproductions