SOTAVerified

Extractive Summarization of Long Documents by Combining Global and Local Context

2019-09-17IJCNLP 2019Code Available0· sign in to hype

Wen Xiao, Giuseppe Carenini

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

In this paper, we propose a novel neural single document extractive summarization model for long documents, incorporating both the global context of the whole document and the local context within the current topic. We evaluate the model on two datasets of scientific papers, Pubmed and arXiv, where it outperforms previous work, both extractive and abstractive models, on ROUGE-1, ROUGE-2 and METEOR scores. We also show that, consistently with our goal, the benefits of our method become stronger as we apply it to longer documents. Rather surprisingly, an ablation study indicates that the benefits of our model seem to come exclusively from modeling the local context, even for the longest documents.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
Arxiv HEP-TH citation graphExtSum-LGROUGE-143.58Unverified
PubmedExtSum-LGROUGE-144.81Unverified

Reproductions