SOTAVerified

Construction of a Multilingual Corpus Annotated with Translation Relations

2018-08-01COLING 2018Unverified0· sign in to hype

Yuming Zhai, Aur{\'e}lien Max, Anne Vilnat

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Translation relations, which distinguish literal translation from other translation techniques, constitute an important subject of study for human translators (Chuquet and Paillard, 1989). However, automatic processing techniques based on interlingual relations, such as machine translation or paraphrase generation exploiting translational equivalence, have not exploited these relations explicitly until now. In this work, we present a categorisation of translation relations and annotate them in a parallel multilingual (English, French, Chinese) corpus of oral presentations, the TED Talks. Our long term objective will be to automatically detect these relations in order to integrate them as important characteristics for the search of monolingual segments in relation of equivalence (paraphrases) or of entailment. The annotated corpus resulting from our work will be made available to the community.

Tasks

Reproductions