SOTAVerified

Using CollGram to Compare Formulaic Language in Human and Neural Machine Translation

2021-07-08Unverified0· sign in to hype

Yves Bestgen

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

A comparison of formulaic sequences in human and neural machine translation of quality newspaper articles shows that neural machine translations contain less lower-frequency, but strongly-associated formulaic sequences, and more high-frequency formulaic sequences. These differences were statistically significant and the effect sizes were almost always medium or large. These observations can be related to the differences between second language learners of various levels and between translated and untranslated texts. The comparison between the neural machine translation systems indicates that some systems produce more formulaic sequences of both types than other systems.

Tasks

Reproductions