Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings

2021-11-01EMNLP 2021Code Available0· sign in to hype

Weixuan Wang, Wei Peng, Meng Zhang, Qun Liu

Code Available — Be the first to reproduce this paper.

Code

github.com/Vicky-Wil/topic-NMT
Officialpytorch★ 7

Abstract

Neural Machine Translation (NMT) has shown a strong ability to utilize local context to disambiguate the meaning of words. However, it remains a challenge for NMT to leverage broader context information like topics. In this paper, we propose heterogeneous ways of embedding topic information at the sentence level into an NMT model to improve translation performance. Specifically, the topic information can be incorporated as pre-encoder topic embedding, post-encoder topic embedding, and decoder topic embedding to increase the likelihood of selecting target words from the same topic of the source sentence. Experimental results show that NMT models with the proposed topic knowledge embedding outperform the baselines on the English -> German and English -> French translation tasks.

Tasks

Decoder Machine Translation NMT Sentence Topic Models Translation

Neural Machine Translation with Heterogeneous Topic Knowledge Embeddings

Code

Abstract

Tasks

Reproductions