SOTAVerified

Comparison between NMT and PBSMT Performance for Translating Noisy User-Generated Content

2019-09-01WS (NoDaLiDa) 2019Unverified0· sign in to hype

José Carlos Rosales Núñez, Djamé Seddah, Guillaume Wisniewski

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This work compares the performances achieved by Phrase-Based Statistical Machine Translation systems (PB-SMT) and attention-based Neuronal Machine Translation systems (NMT) when translating User Generated Content (UGC), as encountered in social medias, from French to English. We show that, contrary to what could be expected, PBSMT outperforms NMT when translating non-canonical inputs. Our error analysis uncovers the specificities of UGC that are problematic for sequential NMT architectures and suggests new avenue for improving NMT models.

Tasks

Reproductions