SOTAVerified

CUNI Submission to MT4All Shared Task

2022-06-01SIGUL (LREC) 2022Unverified0· sign in to hype

Ivana Kvapilíková, Ondrej Bojar

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper describes our submission to the MT4All Shared Task in unsupervised machine translation from English to Ukrainian, Kazakh and Georgian in the legal domain. In addition to the standard pipeline for unsupervised training (pretraining followed by denoising and back-translation), we used supervised training on a pseudo-parallel corpus retrieved from the provided mono-lingual corpora. Our system scored significantly higher than the baseline hybrid unsupervised MT system.

Tasks

Reproductions