Generating Wikipedia by Summarizing Long Sequences

2018-01-30ICLR 2018Code Available0· sign in to hype

Peter J. Liu, Mohammad Saleh, Etienne Pot, Ben Goodrich, Ryan Sepassi, Lukasz Kaiser, Noam Shazeer

Code Available — Be the first to reproduce this paper.

Code

github.com/aseidelo/wiki_generator
tf★ 5
github.com/brsarah20/Alphafold2
pytorch★ 2
github.com/lucidrains/memory-compressed-attention
pytorch★ 0

Abstract

We show that generating English Wikipedia articles can be approached as a multi- document summarization of source documents. We use extractive summarization to coarsely identify salient information and a neural abstractive model to generate the article. For the abstractive model, we introduce a decoder-only architecture that can scalably attend to very long sequences, much longer than typical encoder- decoder architectures used in sequence transduction. We show that this model can generate fluent, coherent multi-sentence paragraphs and even whole Wikipedia articles. When given reference documents, we show it can extract relevant factual information as reflected in perplexity, ROUGE scores and human evaluations.

Tasks

Articles Decoder Document Summarization Extractive Summarization Multi-Document Summarization Sentence

Generating Wikipedia by Summarizing Long Sequences

Code

Abstract

Tasks

Reproductions