SOTAVerified

Countering the Influence of Essay Length in Neural Essay Scoring

2021-11-01EMNLP (sustainlp) 2021Code Available1· sign in to hype

Sungho Jeon, Michael Strube

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Previous work has shown that automated essay scoring systems, in particular machine learning-based systems, are not capable of assessing the quality of essays, but are relying on essay length, a factor irrelevant to writing proficiency. In this work, we first show that state-of-the-art systems, recent neural essay scoring systems, might be also influenced by the correlation between essay length and scores in a standard dataset. In our evaluation, a very simple neural model shows the state-of-the-art performance on the standard dataset. To consider essay content without taking essay length into account, we introduce a simple neural model assessing the similarity of content between an input essay and essays assigned different scores. This neural model achieves performance comparable to the state of the art on a standard dataset as well as on a second dataset. Our findings suggest that neural essay scoring systems should consider the characteristics of datasets to focus on text quality.

Tasks

Reproductions