Unsupervised Paraphrasing Consistency Training for Low Resource Named Entity Recognition

2021-11-01EMNLP 2021Unverified0· sign in to hype

Rui Wang, Ricardo Henao

Unverified — Be the first to reproduce this paper.

Abstract

Unsupervised consistency training is a way of semi-supervised learning that encourages consistency in model predictions between the original and augmented data. For Named Entity Recognition (NER), existing approaches augment the input sequence with token replacement, assuming annotations on the replaced positions unchanged. In this paper, we explore the use of paraphrasing as a more principled data augmentation scheme for NER unsupervised consistency training. Specifically, we convert Conditional Random Field (CRF) into a multi-label classification module and encourage consistency on the entity appearance between the original and paraphrased sequences. Experiments show that our method is especially effective when annotations are limited.

Tasks

Data Augmentation Low Resource Named Entity Recognition Multi-Label Classification MUlTI-LABEL-ClASSIFICATION named-entity-recognition Named Entity Recognition Named Entity Recognition (NER)NER

Unsupervised Paraphrasing Consistency Training for Low Resource Named Entity Recognition

Abstract

Tasks

Reproductions