SOTAVerified

LIUM-MIRACL Participation in the MADAR Arabic Dialect Identification Shared Task

2019-08-01WS 2019Unverified0· sign in to hype

Sam{\'e}h Kchaou, Fethi Bougares, Lamia Hadrich-Belguith

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper describes the joint participation of the LIUM and MIRACL Laboratories at the Arabic dialect identification challenge of the MADAR Shared Task (Bouamor et al., 2019) conducted during the Fourth Arabic Natural Language Processing Workshop (WANLP 2019). We participated to the Travel Domain Dialect Identification subtask. We built several systems and explored different techniques including conventional machine learning methods and deep learning algorithms. Deep learning approaches did not perform well on this task. We experimented several classification systems and we were able to identify the dialect of an input sentence with an F1-score of 65.41\% on the official test set using only the training data supplied by the shared task organizers.

Tasks

Reproductions