ReFinED: An Efficient Zero-shot-capable Approach to End-to-End Entity Linking
Tom Ayoola, Shubhi Tyagi, Joseph Fisher, Christos Christodoulopoulos, Andrea Pierleoni
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/alexa/refinedOfficialIn paperpytorch★ 237
- github.com/amazon-research/ReFinEDOfficialpytorch★ 237
- github.com/amazon-science/ReFinEDpytorch★ 237
Abstract
We introduce ReFinED, an efficient end-to-end entity linking model which uses fine-grained entity types and entity descriptions to perform linking. The model performs mention detection, fine-grained entity typing, and entity disambiguation for all mentions within a document in a single forward pass, making it more than 60 times faster than competitive existing approaches. ReFinED also surpasses state-of-the-art performance on standard entity linking datasets by an average of 3.7 F1. The model is capable of generalising to large-scale knowledge bases such as Wikidata (which has 15 times more entities than Wikipedia) and of zero-shot entity linking. The combination of speed, accuracy and scale makes ReFinED an effective and cost-efficient system for extracting entities from web-scale datasets, for which the model has been successfully deployed. Our code and pre-trained models are available at https://github.com/alexa/ReFinED
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| ACE2004 | ReFinED | Micro-F1 | 91.6 | — | Unverified |
| AIDA-CoNLL | ReFinED | In-KB Accuracy | 93.9 | — | Unverified |
| AQUAINT | ReFinED | Micro-F1 | 91.8 | — | Unverified |
| MSNBC | ReFinED | Micro-F1 | 94.4 | — | Unverified |
| MSNBC | ReFinED | Micro-F1 | 71.8 | — | Unverified |
| WNED-CWEB | ReFinED | Micro-F1 | 79.4 | — | Unverified |
| WNED-WIKI | ReFinED | Micro-F1 | 88.7 | — | Unverified |