SOTAVerified

A Graph-theoretic Summary Evaluation for ROUGE

2018-10-01EMNLP 2018Unverified0· sign in to hype

Elaheh ShafieiBavani, Mohammad Ebrahimi, Raymond Wong, Fang Chen

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

ROUGE is one of the first and most widely used evaluation metrics for text summarization. However, its assessment merely relies on surface similarities between peer and model summaries. Consequently, ROUGE is unable to fairly evaluate summaries including lexical variations and paraphrasing. We propose a graph-based approach adopted into ROUGE to evaluate summaries based on both lexical and semantic similarities. Experiment results over TAC AESOP datasets show that exploiting the lexico-semantic similarity of the words used in summaries would significantly help ROUGE correlate better with human judgments.

Tasks

Reproductions