A Neural Pairwise Ranking Model for Readability Assessment

2022-03-14Findings (ACL) 2022Code Available0· sign in to hype

Justin Lee, Sowmya Vajjala

Code Available — Be the first to reproduce this paper.

Code

github.com/jlee118/nprm
OfficialIn papertf★ 6

Abstract

Automatic Readability Assessment (ARA), the task of assigning a reading level to a text, is traditionally treated as a classification problem in NLP research. In this paper, we propose the first neural, pairwise ranking approach to ARA and compare it with existing classification, regression, and (non-neural) ranking methods. We establish the performance of our model by conducting experiments with three English, one French and one Spanish datasets. We demonstrate that our approach performs well in monolingual single/cross corpus testing scenarios and achieves a zero-shot cross-lingual ranking accuracy of over 80% for both French and Spanish when trained on English data. Additionally, we also release a new parallel bilingual readability dataset in English and French. To our knowledge, this paper proposes the first neural pairwise ranking model for ARA, and shows the first results of cross-lingual, zero-shot evaluation of ARA with neural models.

Tasks

Cross-corpus model Text Classification

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
OneStopEnglish (Readability Assessment)	NPRM-BERT	Accuracy (5-fold)	0.98	—	Unverified

A Neural Pairwise Ranking Model for Readability Assessment

Code

Abstract

Tasks

Benchmark Results

Reproductions