SOTAVerified

Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model

2021-06-08ACL 2021Code Available1· sign in to hype

Hongliang Dai, Yangqiu Song, Haixun Wang

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

Recently, there is an effort to extend fine-grained entity typing by using a richer and ultra-fine set of types, and labeling noun phrases including pronouns and nominal nouns instead of just named entity mentions. A key challenge for this ultra-fine entity typing task is that human annotated data are extremely scarce, and the annotation ability of existing distant or weak supervision approaches is very limited. To remedy this problem, in this paper, we propose to obtain training data for ultra-fine entity typing by using a BERT Masked Language Model (MLM). Given a mention in a sentence, our approach constructs an input for the BERT MLM so that it predicts context dependent hypernyms of the mention, which can be used as type labels. Experimental results demonstrate that, with the help of these automatically generated labels, the performance of an ultra-fine entity typing model can be improved substantially. We also show that our approach can be applied to improve traditional fine-grained entity typing after performing simple type mapping.

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
Ontonotes v5 (English)MLMETF149.1Unverified
Open EntityMLMETF149.1Unverified
Open EntityMLMETF178.2Unverified

Reproductions