Improving Low-Resource Named Entity Recognition using Joint Sentence and Token Labeling

2020-07-01ACL 2020Unverified0· sign in to hype

Canasai Kruengkrai, Thien Hai Nguyen, Sharifah Mahani Aljunied, Lidong Bing

Unverified — Be the first to reproduce this paper.

Abstract

Exploiting sentence-level labels, which are easy to obtain, is one of the plausible methods to improve low-resource named entity recognition (NER), where token-level labels are costly to annotate. Current models for jointly learning sentence and token labeling are limited to binary classification. We present a joint model that supports multi-class classification and introduce a simple variant of self-attention that allows the model to learn scaling factors. Our model produces 3.78\%, 4.20\%, 2.08\% improvements in F1 over the BiLSTM-CRF baseline on e-commerce product titles in three different low-resource languages: Vietnamese, Thai, and Indonesian, respectively.

Tasks

Binary Classification Classification General Classification Low Resource Named Entity Recognition Multi-class Classification named-entity-recognition Named Entity Recognition Named Entity Recognition (NER)NER Sentence

Improving Low-Resource Named Entity Recognition using Joint Sentence and Token Labeling

Abstract

Tasks

Reproductions