Entity Resolution
Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)
Surveys on entity resolution:
-
Christophides et al.: End-to-End Entity Resolution for Big Data: A Survey, 2020.
-
Barlaug and Gulla: Neural Networks for Entity Matching: A Survey, 2021.
The task of entity resolution is closely related to the task of entity alignment which focuses on matching entities between knowledge bases. The task of entity linking differs from entity resolution as entity linking focuses on identifying entity mentions in free text.
Papers
Showing 1–10 of 184 papers
All datasetsAmazon-GoogleAbt-BuyWDC Products-80%cc-seen-mediumWDC Computers-smallWDC Computers-xlargeWDC Products-50%cc-unseen-mediumWDC Watches-smallMusicBrainz20KWDC Watches-xlargeWDC Products-80%cc-seen-medium-multiWDC Products
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | gpt4-0613_fewshot-10 | F1 (%) | 85.21 | — | Unverified |
| 2 | gpt-4o-mini-2024-07-18_fine_tuned | F1 (%) | 80.25 | — | Unverified |
| 3 | RoBERTa-SupCon | F1 (%) | 79.28 | — | Unverified |
| 4 | RobEM | F1 (%) | 79.06 | — | Unverified |
| 5 | Random Forest | F1 (%) | 79 | — | Unverified |
| 6 | HG | F1 (%) | 76.4 | — | Unverified |
| 7 | Ditto | F1 (%) | 75.58 | — | Unverified |
| 8 | CorDEL-Sum | F1 (%) | 70.2 | — | Unverified |
| 9 | DeepMatcher - Hybrid | F1 (%) | 69.3 | — | Unverified |
| 10 | D-HAT | F1 (%) | 67.5 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | gpt4-0613_zeroshot | F1 (%) | 95.78 | — | Unverified |
| 2 | RoBERTa-SupCon | F1 (%) | 94.29 | — | Unverified |
| 3 | gpt-4o-mini-2024-07-18_fine_tuned | F1 (%) | 94.09 | — | Unverified |
| 4 | gpt-4o-2024-08-06 | F1 (%) | 92.2 | — | Unverified |
| 5 | RobEM | F1 (%) | 90.9 | — | Unverified |
| 6 | HG | F1 (%) | 89.8 | — | Unverified |
| 7 | Ditto | F1 (%) | 89.33 | — | Unverified |
| 8 | gpt-4o-mini-2024-07-18 | F1 (%) | 87.68 | — | Unverified |
| 9 | Meta-Llama-3.1-8B-Instruct_fine_tuned | F1 (%) | 87.34 | — | Unverified |
| 10 | Random Forest | F1 (%) | 85 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | gpt4-0613_zeroshot | F1 (%) | 89.61 | — | Unverified |
| 2 | gpt-4o-2024-08-06_fine_tuned_wdc_small | F1 (%) | 87.1 | — | Unverified |
| 3 | gpt-4o-mini-2024-07-18_structured_explanations | F1 (%) | 84.38 | — | Unverified |
| 4 | gpt-4o-mini-2024-07-18 | F1 (%) | 81.61 | — | Unverified |
| 5 | RoBERTa-SupCon | F1 (%) | 79.99 | — | Unverified |
| 6 | Llama3.1_70B_structured_explanations | F1 (%) | 76.7 | — | Unverified |
| 7 | Llama3.1_70B | F1 (%) | 75.2 | — | Unverified |
| 8 | Llama3.1_8B_error-based_example_selection | F1 (%) | 74.37 | — | Unverified |
| 9 | Llama3.1_8B_structured_explanations | F1 (%) | 74.13 | — | Unverified |
| 10 | Ditto | F1 (%) | 73.93 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | BERT | F1 (%) | 96.53 | — | Unverified |
| 2 | RoBERTa-SupCon | F1 (%) | 95.21 | — | Unverified |
| 3 | HG | F1 (%) | 88.5 | — | Unverified |
| 4 | DADER-MMD | F1 (%) | 88 | — | Unverified |
| 5 | Ditto | F1 (%) | 80.76 | — | Unverified |
| 6 | JointBERT | F1 (%) | 77.55 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | RoBERTa-SupCon | F1 (%) | 98.33 | — | Unverified |
| 2 | JointBERT | F1 (%) | 97.49 | — | Unverified |
| 3 | BERT | F1 (%) | 97.37 | — | Unverified |
| 4 | HG | F1 (%) | 96.5 | — | Unverified |
| 5 | Ditto | F1 (%) | 95.45 | — | Unverified |
| 6 | Random Forest | F1 (%) | 78 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | RoBERTa-base | F1 (%) | 71.14 | — | Unverified |
| 2 | Ditto | F1 (%) | 70.66 | — | Unverified |
| 3 | HG | F1 (%) | 68.74 | — | Unverified |
| 4 | RoBERTa-SupCon | F1 (%) | 57.23 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | HG | F1 (%) | 94 | — | Unverified |
| 2 | DADER-NoDA | F1 (%) | 88.6 | — | Unverified |
| 3 | Ditto | F1 (%) | 85.12 | — | Unverified |
| 4 | JointBERT | F1 (%) | 75.83 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | ALMSER-GB | F1 | 0.95 | — | Unverified |
| 2 | FAMER-SplitMerge | F1 | 0.88 | — | Unverified |
| 3 | FAMER-Split | F1 | 0.84 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | JointBERT | F1 (%) | 97.09 | — | Unverified |
| 2 | Ditto | F1 (%) | 96.53 | — | Unverified |
| 3 | HG | F1 (%) | 96.5 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | RoBERTa-SupCon | F1 Micro | 88.63 | — | Unverified |
| 2 | RoBERTa-base | F1 Micro | 52.03 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | gpt-4o-2024-08-06_fine_tuned_wdc_small | F1 (%) | 87.07 | — | Unverified |