It’s not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT

2020-11-01EMNLP (BlackboxNLP) 2020Code Available0· sign in to hype

Hila Gonen, Shauli Ravfogel, Yanai Elazar, Yoav Goldberg

Code Available — Be the first to reproduce this paper.

Code

github.com/gonenhila/mbert
OfficialIn paperpytorch★ 6

Abstract

Recent works have demonstrated that multilingual BERT (mBERT) learns rich cross-lingual representations, that allow for transfer across languages. We study the word-level translation information embedded in mBERT and present two simple methods that expose remarkable translation capabilities with no fine-tuning. The results suggest that most of this information is encoded in a non-linear way, while some of it can also be recovered with purely linear tools. As part of our analysis, we test the hypothesis that mBERT learns representations which contain both a language-encoding component and an abstract, cross-lingual component, and explicitly identify an empirical language-identity subspace within mBERT representations.

Tasks

Translation

It’s not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT

Code

Abstract

Tasks

Reproductions