YerevaNN’s Systems for WMT20 Biomedical Translation Task: The Effect of Fixing Misaligned Sentence Pairs
2020-11-01WMT (EMNLP) 2020Unverified0· sign in to hype
Karen Hambardzumyan, Hovhannes Tamoyan, Hrant Khachatrian
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
This report describes YerevaNN’s neural machine translation systems and data processing pipelines developed for WMT20 biomedical translation task. We provide systems for English-Russian and English-German language pairs. For the English-Russian pair, our submissions achieve the best BLEU scores, with en direction outperforming the other systems by a significant margin. We explain most of the improvements by our heavy data preprocessing pipeline which attempts to fix poorly aligned sentences in the parallel data.