DiHuTra: a Parallel Corpus to Analyse Differences between Human Translations
2022-06-01LREC 2022Code Available0· sign in to hype
Ekaterina Lapshinova-Koltunski, Maja Popović, Maarit Koponen
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/katjakaterina/dihutraOfficialIn papernone★ 3
Abstract
This project aimed to design a corpus of parallel human translations (HTs) of the same source texts by professionals and students. The resulting corpus consists of English news and reviews source texts, their translations into Russian and Croatian, and translations of the reviews into Finnish. The corpus will be valuable for both studying variation in translation and evaluating machine translation (MT) systems.