SOTAVerified

Large-Scale English-Japanese Simultaneous Interpretation Corpus: Construction and Analyses with Sentence-Aligned Data

2021-08-01ACL (IWSLT) 2021Unverified0· sign in to hype

Kosuke Doi, Katsuhito Sudoh, Satoshi Nakamura

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

This paper describes the construction of a new large-scale English-Japanese Simultaneous Interpretation (SI) corpus and presents the results of its analysis. A portion of the corpus contains SI data from three interpreters with different amounts of experience. Some of the SI data were manually aligned with the source speeches at the sentence level. Their latency, quality, and word order aspects were compared among the SI data themselves as well as against offline translations. The results showed that (1) interpreters with more experience controlled the latency and quality better, and (2) large latency hurt the SI quality.

Tasks

Reproductions