SOTAVerified

Translation-based Supervision for Policy Generation in Simultaneous Neural Machine Translation

2021-11-01EMNLP 2021Code Available0· sign in to hype

Ashkan Alinejad, Hassan S. Shavarani, Anoop Sarkar

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

In simultaneous machine translation, finding an agent with the optimal action sequence of reads and writes that maintain a high level of translation quality while minimizing the average lag in producing target tokens remains an extremely challenging problem. We propose a novel supervised learning approach for training an agent that can detect the minimum number of reads required for generating each target token by comparing simultaneous translations against full-sentence translations during training to generate oracle action sequences. These oracle sequences can then be used to train a supervised model for action generation at inference time. Our approach provides an alternative to current heuristic methods in simultaneous translation by introducing a new training objective, which is easier to train than previous attempts at training the agent using reinforcement learning techniques for this task. Our experimental results show that our novel training method for action generation produces much higher quality translations while minimizing the average lag in simultaneous translation.

Tasks

Reproductions