Sentence-Level Multilingual Multi-modal Embedding for Natural Language Processing

2017-09-01RANLP 2017Unverified0· sign in to hype

Iacer Calixto, Qun Liu

Unverified — Be the first to reproduce this paper.

Abstract

We propose a novel discriminative ranking model that learns embeddings from multilingual and multi-modal data, meaning that our model can take advantage of images and descriptions in multiple languages to improve embedding quality. To that end, we introduce an objective function that uses pairwise ranking adapted to the case of three or more input sources. We compare our model against different baselines, and evaluate the robustness of our embeddings on image--sentence ranking (ISR), semantic textual similarity (STS), and neural machine translation (NMT). We find that the additional multilingual signals lead to improvements on all three tasks, and we highlight that our model can be used to consistently improve the adequacy of translations generated with NMT models when re-ranking n-best lists.

Tasks

Machine Translation NMT Re-Ranking Semantic Textual Similarity Sentence STS Translation

Sentence-Level Multilingual Multi-modal Embedding for Natural Language Processing

Abstract

Tasks

Reproductions