SOTAVerified

Kakao Enterprise’s WMT21 Machine Translation Using Terminologies Task Submission

2021-11-01WMT (EMNLP) 2021Unverified0· sign in to hype

Yunju Bak, Jimin Sun, Jay Kim, Sungwon Lyu, Changmin Lee

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper describes Kakao Enterprise’s submission to the WMT21 shared Machine Translation using Terminologies task. We integrate terminology constraints by pre-training with target lemma annotations and fine-tuning with exact target annotations utilizing the given terminology dataset. This approach yields a model that achieves outstanding results in terms of both translation quality and term consistency, ranking first based on COMET in the En→Fr language direction. Furthermore, we explore various methods such as back-translation, explicitly training terminologies as additional parallel data, and in-domain data selection.

Tasks

Reproductions