Don't Give Me the Details, Just the Summary! Topic-Aware Convolutional Neural Networks for Extreme Summarization
Shashi Narayan, Shay B. Cohen, Mirella Lapata
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/shashiongithub/XSumOfficialIn paperpytorch★ 0
- github.com/pltrdy/extoracle_summarizationnone★ 0
- github.com/EdinburghNLP/XSumpytorch★ 0
Abstract
We introduce extreme summarization, a new single-document summarization task which does not favor extractive strategies and calls for an abstractive modeling approach. The idea is to create a short, one-sentence news summary answering the question "What is the article about?". We collect a real-world, large-scale dataset for this task by harvesting online articles from the British Broadcasting Corporation (BBC). We propose a novel abstractive model which is conditioned on the article's topics and based entirely on convolutional neural networks. We demonstrate experimentally that this architecture captures long-range dependencies in a document and recognizes pertinent content, outperforming an oracle extractive system and state-of-the-art abstractive approaches when evaluated automatically and by humans.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| X-Sum | T-ConvS2S | ROUGE-1 | 31.89 | — | Unverified |
| X-Sum | Baseline : Extractive Oracle | ROUGE-1 | 29.79 | — | Unverified |
| X-Sum | PtGen | ROUGE-1 | 29.7 | — | Unverified |
| X-Sum | Seq2Seq | ROUGE-1 | 28.42 | — | Unverified |
| X-Sum | PtGen-Covg | ROUGE-1 | 28.1 | — | Unverified |
| X-Sum | Baseline : Lead-3 | ROUGE-1 | 16.3 | — | Unverified |
| X-Sum | Baseline : Random | ROUGE-1 | 15.16 | — | Unverified |