The NITS-CNLP System for the Unsupervised MT Task at WMT 2020
2020-11-01WMT (EMNLP) 2020Unverified0· sign in to hype
Salam Michael Singh, Thoudam Doren Singh, Sivaji Bandyopadhyay
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
We describe NITS-CNLP’s submission to WMT 2020 unsupervised machine translation shared task for German language (de) to Upper Sorbian (hsb) in a constrained setting i.e, using only the data provided by the organizers. We train our unsupervised model using monolingual data from both the languages by jointly pre-training the encoder and decoder and fine-tune using backtranslation loss. The final model uses the source side (de) monolingual data and the target side (hsb) synthetic data as a pseudo-parallel data to train a pseudo-supervised system which is tuned using the provided development set(dev set).