Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model

2021-06-08ACL 2021Code Available1· sign in to hype

Hongliang Dai, Yangqiu Song, Haixun Wang

Code Available — Be the first to reproduce this paper.

Code

github.com/HKUST-KnowComp/MLMET
OfficialIn paperpytorch★ 18

Abstract

Recently, there is an effort to extend fine-grained entity typing by using a richer and ultra-fine set of types, and labeling noun phrases including pronouns and nominal nouns instead of just named entity mentions. A key challenge for this ultra-fine entity typing task is that human annotated data are extremely scarce, and the annotation ability of existing distant or weak supervision approaches is very limited. To remedy this problem, in this paper, we propose to obtain training data for ultra-fine entity typing by using a BERT Masked Language Model (MLM). Given a mention in a sentence, our approach constructs an input for the BERT MLM so that it predicts context dependent hypernyms of the mention, which can be used as type labels. Experimental results demonstrate that, with the help of these automatically generated labels, the performance of an ultra-fine entity typing model can be improved substantially. We also show that our approach can be applied to improve traditional fine-grained entity typing after performing simple type mapping.

Tasks

Entity Typing Language Modeling Language Modelling Sentence

Benchmark Results

Dataset	Model	Metric	Claimed	Verified	Status
Ontonotes v5 (English)	MLMET	F1	49.1	—	Unverified
Open Entity	MLMET	F1	49.1	—	Unverified
Open Entity	MLMET	F1	78.2	—	Unverified

Ultra-Fine Entity Typing with Weak Supervision from a Masked Language Model

Code

Abstract

Tasks

Benchmark Results

Reproductions