A Novel Workflow for Accurately and Efficiently Crowdsourcing Predicate Senses and Argument Labels

2020-11-01Findings of the Association for Computational LinguisticsCode Available0· sign in to hype

Youxuan Jiang, Huaiyu Zhu, Jonathan K. Kummerfeld, Yunyao Li, Walter Lasecki

Code Available — Be the first to reproduce this paper.

Code

github.com/system-t/crowdsourcingsrl
OfficialIn papernone★ 0

Abstract

Resources for Semantic Role Labeling (SRL) are typically annotated by experts at great expense. Prior attempts to develop crowdsourcing methods have either had low accuracy or required substantial expert annotation. We propose a new multi-stage crowd workflow that substantially reduces expert involvement without sacrificing accuracy. In particular, we introduce a unique filter stage based on the key observation that crowd workers are able to almost perfectly filter out incorrect options for labels. Our three-stage workflow produces annotations with 95\% accuracy for predicate labels and 93\% for argument labels, which is comparable to expert agreement. Compared to prior work on crowdsourcing for SRL, we decrease expert effort by 4x, from 56\% to 14\% of cases. Our approach enables more scalable annotation of SRL, and could enable annotation of NLP tasks that have previously been considered too complex to effectively crowdsource.

Tasks

Semantic Role Labeling

A Novel Workflow for Accurately and Efficiently Crowdsourcing Predicate Senses and Argument Labels

Code

Abstract

Tasks

Reproductions