Enhancing Machine Translation with Dependency-Aware Self-Attention

2019-09-06ACL 2020Code Available0· sign in to hype

Emanuele Bugliarello, Naoaki Okazaki

Code Available — Be the first to reproduce this paper.

Code

github.com/e-bug/pascal
OfficialIn paperpytorch★ 0

Abstract

Most neural machine translation models only rely on pairs of parallel sentences, assuming syntactic information is automatically learned by an attention mechanism. In this work, we investigate different approaches to incorporate syntactic knowledge in the Transformer model and also propose a novel, parameter-free, dependency-aware self-attention mechanism that improves its translation quality, especially for long sentences and in low-resource scenarios. We show the efficacy of each approach on WMT English-German and English-Turkish, and WAT English-Japanese translation tasks.

Tasks

Machine Translation Translation

Enhancing Machine Translation with Dependency-Aware Self-Attention

Code

Abstract

Tasks

Reproductions