SOTAVerified

What the F-measure doesn't measure: Features, Flaws, Fallacies and Fixes

2015-03-22Unverified0· sign in to hype

David M. W. Powers

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

The F-measure or F-score is one of the most commonly used single number measures in Information Retrieval, Natural Language Processing and Machine Learning, but it is based on a mistake, and the flawed assumptions render it unsuitable for use in most contexts! Fortunately, there are better alternatives.

Tasks

Reproductions