SOTAVerified

Facts That Matter

2018-10-01EMNLP 2018Code Available0· sign in to hype

Marco Ponza, Luciano del Corro, Gerhard Weikum

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

This work introduces fact salience: The task of generating a machine-readable representation of the most prominent information in a text document as a set of facts. We also present SalIE, the first fact salience system. SalIE is unsupervised and knowledge agnostic, based on open information extraction to detect facts in natural language text, PageRank to determine their relevance, and clustering to promote diversity. We compare SalIE with several baselines (including positional, standard for saliency tasks), and in an extrinsic evaluation, with state-of-the-art automatic text summarizers. SalIE outperforms baselines and text summarizers showing that facts are an effective way to compress information.

Tasks

Reproductions