Learn to Copy from the Copying History: Correlational Copy Network for Abstractive Summarization

2021-11-01EMNLP 2021Code Available0· sign in to hype

Haoran Li, Song Xu, Peng Yuan, Yujia Wang, Youzheng Wu, Xiaodong He, BoWen Zhou

Code Available — Be the first to reproduce this paper.

Code

github.com/hrlinlp/coconet
Officialpytorch★ 6

Abstract

The copying mechanism has had considerable success in abstractive summarization, facilitating models to directly copy words from the input text to the output summary. Existing works mostly employ encoder-decoder attention, which applies copying at each time step independently of the former ones. However, this may sometimes lead to incomplete copying. In this paper, we propose a novel copying scheme named Correlational Copying Network (CoCoNet) that enhances the standard copying mechanism by keeping track of the copying history. It thereby takes advantage of prior copying distributions and, at each time step, explicitly encourages the model to copy the input word that is relevant to the previously copied one. In addition, we strengthen CoCoNet through pre-training with suitable corpora that simulate the copying behaviors. Experimental results show that CoCoNet can copy more accurately and achieves new state-of-the-art performances on summarization benchmarks, including CNN/DailyMail for news summarization and SAMSum for dialogue summarization. The code and checkpoint will be publicly available.

Tasks

Abstractive Text Summarization Decoder News Summarization

Learn to Copy from the Copying History: Correlational Copy Network for Abstractive Summarization

Code

Abstract

Tasks

Reproductions