SOTAVerified

Comparing Two Basic Methods for Discriminating Between Similar Languages and Varieties

2016-12-01WS 2016Unverified0· sign in to hype

Pablo Gamallo, I{\~n}aki Alegria, Jos{\'e} Ramom Pichel, Manex Agirrezabal

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This article describes the systems submitted by the Citius\_Ixa\_Imaxin team to the Discriminating Similar Languages Shared Task 2016. The systems are based on two different strategies: classification with ranked dictionaries and Naive Bayes classifiers. The results of the evaluation show that ranking dictionaries are more sound and stable across different domains while basic bayesian models perform reasonably well on in-domain datasets, but their performance drops when they are applied on out-of-domain texts.

Tasks

Reproductions