SOTAVerified

HowSumm: A Multi-Document Summarization Dataset Derived from WikiHow Articles

2021-10-07Code Available1· sign in to hype

Odellia Boni, Guy Feigenblat, Guy Lev, Michal Shmueli-Scheuer, Benjamin Sznajder, David Konopnicki

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

We present HowSumm, a novel large-scale dataset for the task of query-focused multi-document summarization (qMDS), which targets the use-case of generating actionable instructions from a set of sources. This use-case is different from the use-cases covered in existing multi-document summarization (MDS) datasets and is applicable to educational and industrial scenarios. We employed automatic methods, and leveraged statistics from existing human-crafted qMDS datasets, to create HowSumm from wikiHow website articles and the sources they cite. We describe the creation of the dataset and discuss the unique features that distinguish it from other summarization corpora. Automatic and human evaluations of both extractive and abstractive summarization models on the dataset reveal that there is room for improvement.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
HowSumm-MethodLexRank (query: method + article + steps titles)ROUGE-153.5Unverified
HowSumm-MethodCES (query: method + article + steps titles)ROUGE-152.2Unverified
HowSumm-MethodGreedyRel (query: method + article + steps titles)ROUGE-148.6Unverified
HowSumm-MethodCES (query: method title)ROUGE-148.4Unverified
HowSumm-MethodCES (query: method + article titles)ROUGE-148.3Unverified
HowSumm-MethodLexRank (query: method title)ROUGE-147.7Unverified
HowSumm-MethodLexRank (query: method + article titles)ROUGE-147.1Unverified
HowSumm-MethodGreedyRel (query: method title)ROUGE-143.4Unverified
HowSumm-MethodGreedyRel (query: method + article titles)ROUGE-142.3Unverified
HowSumm-StepLexRank (query: step title)ROUGE-139.6Unverified
HowSumm-StepCES (query: step title)ROUGE-139.3Unverified
HowSumm-StepCES (query: step + method titles)ROUGE-138.3Unverified
HowSumm-StepLexRank (query: step + method titles)ROUGE-138.2Unverified
HowSumm-StepCES (query: step + method + article titles)ROUGE-137Unverified
HowSumm-StepLexRank (query: step + method + article titles)ROUGE-136.3Unverified
HowSumm-StepGreedyRel (query: step + method titles)ROUGE-130.3Unverified
HowSumm-StepGreedyRel (query: step title)ROUGE-130.1Unverified
HowSumm-StepBM25-HierSumm (query: step + method titles)ROUGE-123Unverified
HowSumm-StepBM25-HierSumm (query: step title)ROUGE-122.3Unverified
HowSumm-StepBM25-HierSumm (query: step + method + article titles)ROUGE-121.9Unverified

Reproductions