Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion

2018-04-20EMNLP 2018Code Available0· sign in to hype

Armand Joulin, Piotr Bojanowski, Tomas Mikolov, Herve Jegou, Edouard Grave

Code Available — Be the first to reproduce this paper.

Code

github.com/Kelechukwu1/PidginUNMT
pytorch★ 0
github.com/facebookresearch/fastText/blob/master/docs/aligned-vectors.md
none★ 0
github.com/keleog/PidginUNMT
pytorch★ 0

Abstract

Continuous word representations learned separately on distinct languages can be aligned so that their words become comparable in a common space. Existing works typically solve a least-square regression problem to learn a rotation aligning a small bilingual lexicon, and use a retrieval criterion for inference. In this paper, we propose an unified formulation that directly optimizes a retrieval criterion in an end-to-end fashion. Our experiments on standard benchmarks show that our approach outperforms the state of the art on word translation, with the biggest improvements observed for distant language pairs such as English-Chinese.

Tasks

regression Retrieval Translation Word Translation

Loss in Translation: Learning Bilingual Word Mapping with a Retrieval Criterion

Code

Abstract

Tasks

Reproductions