SOTAVerified

Barch: an English Dataset of Bar Chart Summaries

2022-06-01LREC 2022Unverified0· sign in to hype

Iza Škrjanec, Muhammad Salman Edhi, Vera Demberg

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

We present Barch, a new English dataset of human-written summaries describing bar charts. This dataset contains 47 charts based on a selection of 18 topics. Each chart is associated with one of the four intended messages expressed in the chart title. Using crowdsourcing, we collected around 20 summaries per chart, or one thousand in total. The text of the summaries is aligned with the chart data as well as with analytical inferences about the data drawn by humans. Our datasets is one of the first to explore the effect of intended messages on the data descriptions in chart summaries. Additionally, it lends itself well to the task of training data-driven systems for chart-to-text generation. We provide results on the performance of state-of-the-art neural generation models trained on this dataset and discuss the strengths and shortcomings of different models.

Tasks

Reproductions