SOTAVerified

Towards a Task-Agnostic Model of Difficulty Estimation for Supervised Learning Tasks

2020-12-01Asian Chapter of the Association for Computational LinguisticsCode Available0· sign in to hype

Antonio Laverghetta Jr., Jamshidbek Mirzakhalov, John Licato

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Curriculum learning, a training strategy where training data are ordered based on their difficulty, has been shown to improve performance and reduce training time on various NLP tasks. While much work over the years has developed novel approaches for generating curricula, these strategies are typically only suited for the task they were designed for. This work explores developing a task-agnostic model for problem difficulty and applying it to the Stanford Natural Language Inference (SNLI) dataset. Using the human responses that come with the dev set of SNLI, we train both regression and classification models to predict how many annotators will answer a question correctly and then project the difficulty estimates onto the full SNLI train set to create the curriculum. We argue that our curriculum is effectively capturing difficulty for this task through various analyses of both the model and the predicted difficulty scores.

Tasks

Reproductions