Bilingual Alignment Pre-Training for Zero-Shot Cross-Lingual Transfer

2021-06-03EMNLP (MRQA) 2021Code Available0· sign in to hype

Ziqing Yang, Wentao Ma, Yiming Cui, Jiani Ye, Wanxiang Che, Shijin Wang

Code Available — Be the first to reproduce this paper.

Code

github.com/Katherinaxxx/Paper-ReadingNotes
none★ 0

Abstract

Multilingual pre-trained models have achieved remarkable performance on cross-lingual transfer learning. Some multilingual models such as mBERT, have been pre-trained on unlabeled corpora, therefore the embeddings of different languages in the models may not be aligned very well. In this paper, we aim to improve the zero-shot cross-lingual transfer performance by proposing a pre-training task named Word-Exchange Aligning Model (WEAM), which uses the statistical alignment information as the prior knowledge to guide cross-lingual word prediction. We evaluate our model on multilingual machine reading comprehension task MLQA and natural language interface task XNLI. The results show that WEAM can significantly improve the zero-shot performance.

Tasks

Cross-Lingual Transfer Language Modelling Machine Reading Comprehension Reading Comprehension Transfer Learning Zero-Shot Cross-Lingual Transfer

Bilingual Alignment Pre-Training for Zero-Shot Cross-Lingual Transfer

Code

Abstract

Tasks

Reproductions