MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs

2021-04-18EMNLP 2021Code Available1· sign in to hype

Zewen Chi, Li Dong, Shuming Ma, Shaohan Huang Xian-Ling Mao, Heyan Huang, Furu Wei

Code Available — Be the first to reproduce this paper.

Code

github.com/CZWin32768/xnlg
pytorch★ 129
github.com/pwc-1/Paper-9/tree/main/2/mt5
mindspore★ 0

Abstract

Multilingual T5 (mT5) pretrains a sequence-to-sequence model on massive monolingual texts, which has shown promising results on many cross-lingual tasks. In this paper, we improve multilingual text-to-text transfer Transformer with translation pairs (mT6). Specifically, we explore three cross-lingual text-to-text pre-training tasks, namely, machine translation, translation pair span corruption, and translation span corruption. In addition, we propose a partially non-autoregressive objective for text-to-text pre-training. We evaluate the methods on eight multilingual benchmark datasets, including sentence classification, named entity recognition, question answering, and abstractive summarization. Experimental results show that the proposed mT6 improves cross-lingual transferability over mT5.

Tasks

Abstractive Text Summarization Machine Translation named-entity-recognition Named Entity Recognition Named Entity Recognition (NER)Question Answering Sentence Sentence Classification Translation

MT6: Multilingual Pretrained Text-to-Text Transformer with Translation Pairs

Code

Abstract

Tasks

Reproductions