Kakao Enterprise’s WMT21 Machine Translation Using Terminologies Task Submission
2021-11-01WMT (EMNLP) 2021Unverified0· sign in to hype
Yunju Bak, Jimin Sun, Jay Kim, Sungwon Lyu, Changmin Lee
Unverified — Be the first to reproduce this paper.
ReproduceAbstract
This paper describes Kakao Enterprise’s submission to the WMT21 shared Machine Translation using Terminologies task. We integrate terminology constraints by pre-training with target lemma annotations and fine-tuning with exact target annotations utilizing the given terminology dataset. This approach yields a model that achieves outstanding results in terms of both translation quality and term consistency, ranking first based on COMET in the En→Fr language direction. Furthermore, we explore various methods such as back-translation, explicitly training terminologies as additional parallel data, and in-domain data selection.