Without Further Ado: Direct and Simultaneous Speech Translation by AppTek in 2021

2021-08-01ACL (IWSLT) 2021Unverified0· sign in to hype

Parnia Bahar, Patrick Wilken, Mattia A. Di Gangi, Evgeny Matusov

Unverified — Be the first to reproduce this paper.

Abstract

This paper describes the offline and simultaneous speech translation systems developed at AppTek for IWSLT 2021. Our offline ST submission includes the direct end-to-end system and the so-called posterior tight integrated model, which is akin to the cascade system but is trained in an end-to-end fashion, where all the cascaded modules are end-to-end models themselves. For simultaneous ST, we combine hybrid automatic speech recognition with a machine translation approach whose translation policy decisions are learned from statistical word alignments. Compared to last year, we improve general quality and provide a wider range of quality/latency trade-offs, both due to a data augmentation method making the MT model robust to varying chunk sizes. Finally, we present a method for ASR output segmentation into sentences that introduces a minimal additional delay.

Tasks

Automatic Speech Recognition Automatic Speech Recognition (ASR)Data Augmentation Machine Translation speech-recognition Speech Recognition Translation

Without Further Ado: Direct and Simultaneous Speech Translation by AppTek in 2021

Abstract

Tasks

Reproductions