ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation

2020-01-26Code Available1· sign in to hype

Dongling Xiao, Han Zhang, Yukun Li, Yu Sun, Hao Tian, Hua Wu, Haifeng Wang

Code Available — Be the first to reproduce this paper.

Code

github.com/Sharpiless/Versailles-text-generation-with-paddlepaddle
paddle★ 18
github.com/https-github-com-GiangHoang9912/ernie-gen
paddle★ 2
github.com/MindCode-4/code-10/tree/main/FDDE/train-fine
mindspore★ 0
github.com/PaddlePaddle/PaddleNLP/blob/develop/paddlenlp/transformers/ernie_gen/modeling.py
paddle★ 0
github.com/MindSpore-scientific/code-12/tree/main/FDDE/train-fine
mindspore★ 0

Abstract

Current pre-training works in natural language generation pay little attention to the problem of exposure bias on downstream tasks. To address this issue, we propose an enhanced multi-flow sequence to sequence pre-training and fine-tuning framework named ERNIE-GEN, which bridges the discrepancy between training and inference with an infilling generation mechanism and a noise-aware generation method. To make generation closer to human writing patterns, this framework introduces a span-by-span generation flow that trains the model to predict semantically-complete spans consecutively rather than predicting word by word. Unlike existing pre-training methods, ERNIE-GEN incorporates multi-granularity target sampling to construct pre-training data, which enhances the correlation between encoder and decoder. Experimental results demonstrate that ERNIE-GEN achieves state-of-the-art results with a much smaller amount of pre-training data and parameters on a range of language generation tasks, including abstractive summarization (Gigaword and CNN/DailyMail), question generation (SQuAD), dialogue generation (Persona-Chat) and generative question answering (CoQA).

Tasks

Abstractive Text Summarization Decoder Dialogue Generation Generative Question Answering Question Generation Question-Generation Text Generation Text Summarization

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
CNN / Daily Mail	ERNIE-GENBASE	ROUGE-1	42.3	—	Unverified
CNN / Daily Mail	ERNIE-GENLARGE	ROUGE-1	44.02	—	Unverified
CNN / Daily Mail	ERNIE-GENLARGE (large-scale text corpora)	ROUGE-1	44.31	—	Unverified

ERNIE-GEN: An Enhanced Multi-Flow Pre-training and Fine-tuning Framework for Natural Language Generation

Code

Abstract

Tasks

Benchmark Results

Reproductions