SOTAVerified

Entity Resolution

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Surveys on entity resolution:

The task of entity resolution is closely related to the task of entity alignment which focuses on matching entities between knowledge bases. The task of entity linking differs from entity resolution as entity linking focuses on identifying entity mentions in free text.

Papers

Showing 150 of 184 papers

TitleStatusHype
Can Foundation Models Wrangle Your Data?Code5
AutoBlock: A Hands-off Blocking Framework for Entity MatchingCode1
How to Evaluate Entity Resolution Systems: An Entity-Centric Framework with Application to Inventor Name DisambiguationCode1
Intermediate Training of BERT for Product MatchingCode1
Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and BeyondCode1
PIZZA: A new benchmark for complex end-to-end task-oriented parsingCode1
Match, Compare, or Select? An Investigation of Large Language Models for Entity MatchingCode1
Entity Matching using Large Language ModelsCode1
Deep Indexed Active Learning for Matching Heterogeneous Entity RepresentationsCode1
A Critical Re-evaluation of Neural Methods for Entity AlignmentCode1
Deep Entity Matching with Pre-Trained Language ModelsCode1
Cost-Effective In-Context Learning for Entity Resolution: A Design Space ExplorationCode1
Supervised Contrastive Learning for Product MatchingCode1
A Deep Learning Approach to Geographical Candidate Selection through Toponym MatchingCode1
Entity Resolution with Hierarchical Graph Attention NetworksCode1
WDC Products: A Multi-Dimensional Entity Matching BenchmarkCode1
Fine-tuning Large Language Models for Entity MatchingCode1
Estimating the Performance of Entity Resolution Algorithms: Lessons Learned Through PatentsView.orgCode1
Dual-Objective Fine-Tuning of BERT for Entity MatchingCode1
Domain Adaptation for Deep Entity Resolution: A Design Space ExplorationCode1
A Practioner's Guide to Evaluating Entity Resolution ResultsCode1
Using ChatGPT for Entity MatchingCode1
Pre-trained Embeddings for Entity Resolution: An Experimental Analysis [Experiment, Analysis & Benchmark]Code1
Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data IntegrationCode1
SC-Block: Supervised Contrastive Blocking within Entity Resolution PipelinesCode0
Probing the Robustness of Pre-trained Language Models for Entity MatchingCode0
Profiling Entity Matching Benchmark TasksCode0
ZeroER: Entity Resolution using Zero Labeled ExamplesCode0
Analyzing how BERT performs entity matchingCode0
Learning Text Representations for 500K Classification Tasks on Named Entity DisambiguationCode0
In Search of an Entity Resolution OASIS: Optimal Asymptotic Sequential Importance SamplingCode0
Optimal Transport-based Alignment of Learned Character Representations for String SimilarityCode0
Text2Tracks: Prompt-based Music Recommendation via Generative RetrievalCode0
FairER: Entity Resolution with Fairness ConstraintsCode0
Effective Explanations for Entity Resolution ModelsCode0
FlexER: Flexible Entity Resolution for Multiple IntentsCode0
Deep Learning for Entity Matching: A Design Space ExplorationCode0
Active Gradual Machine Learning for Entity ResolutionCode0
ChatPD: An LLM-driven Paper-Dataset Networking SystemCode0
EAGER: Embedding-Assisted Entity Resolution for Knowledge GraphsCode0
Graph-boosted Active Learning for Multi-Source Entity ResolutionCode0
CEREC: A Corpus for Entity Resolution in Email ConversationsCode0
Crowdsourcing and Aggregating Nested Markable AnnotationsCode0
Accelerating Column Generation via Flexible Dual Optimal Inequalities with Application to Entity ResolutionCode0
d-blink: Distributed End-to-End Bayesian Entity ResolutionCode0
Deduplication Over Heterogeneous Attribute Types (D-HAT)Code0
Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based Encoder For Legal Violation Detection and ResolutionCode0
Biomedical Named Entity Recognition at ScaleCode0
Cross-Language Learning for Entity MatchingCode0
A Critical Re-evaluation of Benchmark Datasets for (Deep) Learning-Based Matching AlgorithmsCode0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1gpt4-0613_fewshot-10F1 (%)85.21Unverified
2gpt-4o-mini-2024-07-18_fine_tunedF1 (%)80.25Unverified
3RoBERTa-SupConF1 (%)79.28Unverified
4RobEMF1 (%)79.06Unverified
5Random ForestF1 (%)79Unverified
6HGF1 (%)76.4Unverified
7DittoF1 (%)75.58Unverified
8CorDEL-SumF1 (%)70.2Unverified
9DeepMatcher - HybridF1 (%)69.3Unverified
10D-HATF1 (%)67.5Unverified
#ModelMetricClaimedVerifiedStatus
1gpt4-0613_zeroshotF1 (%)95.78Unverified
2RoBERTa-SupConF1 (%)94.29Unverified
3gpt-4o-mini-2024-07-18_fine_tunedF1 (%)94.09Unverified
4gpt-4o-2024-08-06F1 (%)92.2Unverified
5RobEMF1 (%)90.9Unverified
6HGF1 (%)89.8Unverified
7DittoF1 (%)89.33Unverified
8gpt-4o-mini-2024-07-18F1 (%)87.68Unverified
9Meta-Llama-3.1-8B-Instruct_fine_tunedF1 (%)87.34Unverified
10Random ForestF1 (%)85Unverified
#ModelMetricClaimedVerifiedStatus
1gpt4-0613_zeroshotF1 (%)89.61Unverified
2gpt-4o-2024-08-06_fine_tuned_wdc_smallF1 (%)87.1Unverified
3gpt-4o-mini-2024-07-18_structured_explanationsF1 (%)84.38Unverified
4gpt-4o-mini-2024-07-18F1 (%)81.61Unverified
5RoBERTa-SupConF1 (%)79.99Unverified
6Llama3.1_70B_structured_explanationsF1 (%)76.7Unverified
7Llama3.1_70BF1 (%)75.2Unverified
8Llama3.1_8B_error-based_example_selectionF1 (%)74.37Unverified
9Llama3.1_8B_structured_explanationsF1 (%)74.13Unverified
10DittoF1 (%)73.93Unverified
#ModelMetricClaimedVerifiedStatus
1BERTF1 (%)96.53Unverified
2RoBERTa-SupConF1 (%)95.21Unverified
3HGF1 (%)88.5Unverified
4DADER-MMDF1 (%)88Unverified
5DittoF1 (%)80.76Unverified
6JointBERTF1 (%)77.55Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa-SupConF1 (%)98.33Unverified
2JointBERTF1 (%)97.49Unverified
3BERTF1 (%)97.37Unverified
4HGF1 (%)96.5Unverified
5DittoF1 (%)95.45Unverified
6Random ForestF1 (%)78Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa-baseF1 (%)71.14Unverified
2DittoF1 (%)70.66Unverified
3HGF1 (%)68.74Unverified
4RoBERTa-SupConF1 (%)57.23Unverified
#ModelMetricClaimedVerifiedStatus
1HGF1 (%)94Unverified
2DADER-NoDAF1 (%)88.6Unverified
3DittoF1 (%)85.12Unverified
4JointBERTF1 (%)75.83Unverified
#ModelMetricClaimedVerifiedStatus
1ALMSER-GBF10.95Unverified
2FAMER-SplitMergeF10.88Unverified
3FAMER-SplitF10.84Unverified
#ModelMetricClaimedVerifiedStatus
1JointBERTF1 (%)97.09Unverified
2DittoF1 (%)96.53Unverified
3HGF1 (%)96.5Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa-SupConF1 Micro88.63Unverified
2RoBERTa-baseF1 Micro52.03Unverified
#ModelMetricClaimedVerifiedStatus
1gpt-4o-2024-08-06_fine_tuned_wdc_smallF1 (%)87.07Unverified