SOTAVerified

Entity Resolution

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Surveys on entity resolution:

The task of entity resolution is closely related to the task of entity alignment which focuses on matching entities between knowledge bases. The task of entity linking differs from entity resolution as entity linking focuses on identifying entity mentions in free text.

Papers

Showing 150 of 184 papers

TitleStatusHype
Can Foundation Models Wrangle Your Data?Code5
Fine-tuning Large Language Models for Entity MatchingCode1
Match, Compare, or Select? An Investigation of Large Language Models for Entity MatchingCode1
How to Evaluate Entity Resolution Systems: An Entity-Centric Framework with Application to Inventor Name DisambiguationCode1
Cost-Effective In-Context Learning for Entity Resolution: A Design Space ExplorationCode1
Entity Matching using Large Language ModelsCode1
Using ChatGPT for Entity MatchingCode1
Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data IntegrationCode1
Pre-trained Embeddings for Entity Resolution: An Experimental Analysis [Experiment, Analysis & Benchmark]Code1
WDC Products: A Multi-Dimensional Entity Matching BenchmarkCode1
PIZZA: A new benchmark for complex end-to-end task-oriented parsingCode1
Estimating the Performance of Entity Resolution Algorithms: Lessons Learned Through PatentsView.orgCode1
Entity Resolution with Hierarchical Graph Attention NetworksCode1
Domain Adaptation for Deep Entity Resolution: A Design Space ExplorationCode1
A Critical Re-evaluation of Neural Methods for Entity AlignmentCode1
Supervised Contrastive Learning for Product MatchingCode1
Dual-Objective Fine-Tuning of BERT for Entity MatchingCode1
Rotom: A Meta-Learned Data Augmentation Framework for Entity Matching, Data Cleaning, Text Classification, and BeyondCode1
Deep Indexed Active Learning for Matching Heterogeneous Entity RepresentationsCode1
A Deep Learning Approach to Geographical Candidate Selection through Toponym MatchingCode1
Intermediate Training of BERT for Product MatchingCode1
Deep Entity Matching with Pre-Trained Language ModelsCode1
AutoBlock: A Hands-off Blocking Framework for Entity MatchingCode1
A Practioner's Guide to Evaluating Entity Resolution ResultsCode1
ChatPD: An LLM-driven Paper-Dataset Networking SystemCode0
Automated Construction of a Knowledge Graph of Nuclear Fusion Energy for Effective Elicitation and Retrieval of Information0
Text2Tracks: Prompt-based Music Recommendation via Generative RetrievalCode0
From Documents to Dialogue: Building KG-RAG Enhanced AI Assistants0
Leveraging User-Generated Metadata of Online Videos for Cover Song Identification0
Leveraging large language models for efficient representation learning for entity resolution0
Bonafide at LegalLens 2024 Shared Task: Using Lightweight DeBERTa Based Encoder For Legal Violation Detection and ResolutionCode0
Gem: Gaussian Mixture Model Embeddings for Numerical Feature Distributions0
T-KAER: Transparency-enhanced Knowledge-Augmented Entity Resolution FrameworkCode0
Learning variant product relationship and variation attributes from e-commerce website structures0
Entity Augmentation for Efficient Classification of Vertically Partitioned Data with Limited Overlap0
Learning from Natural Language Explanations for Generalizable Entity Matching0
Towards Universal Dense Blocking for Entity ResolutionCode0
Methods for Matching English Language Addresses0
Neural Locality Sensitive Hashing for Entity Blocking0
Spatial Entity Resolution between Restaurant Locations and Transportation Destinations in Southeast Asia0
On Leveraging Large Language Models for Enhancing Entity Resolution: A Cost-efficient Approach0
Cost-Efficient Prompt Engineering for Unsupervised Entity Resolution0
Graph Representation Learning Towards Patents Network Analysis0
Labeling without Seeing? Blind Annotation for Privacy-Preserving Entity Resolution0
Revisiting Prompt Engineering via Declarative Crowdsourcing0
Named Entity Resolution in Personal Knowledge Graphs0
A Critical Re-evaluation of Benchmark Datasets for (Deep) Learning-Based Matching AlgorithmsCode0
Record Deduplication for Entity Distribution Modeling in ASR Transcripts0
Combining Global and Local Merges in Logic-based Entity Resolution0
Beyond Rule-based Named Entity Recognition and Relation Extraction for Process Model Generation from Natural Language Text0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1gpt4-0613_fewshot-10F1 (%)85.21Unverified
2gpt-4o-mini-2024-07-18_fine_tunedF1 (%)80.25Unverified
3RoBERTa-SupConF1 (%)79.28Unverified
4RobEMF1 (%)79.06Unverified
5Random ForestF1 (%)79Unverified
6HGF1 (%)76.4Unverified
7DittoF1 (%)75.58Unverified
8CorDEL-SumF1 (%)70.2Unverified
9DeepMatcher - HybridF1 (%)69.3Unverified
10D-HATF1 (%)67.5Unverified
#ModelMetricClaimedVerifiedStatus
1gpt4-0613_zeroshotF1 (%)95.78Unverified
2RoBERTa-SupConF1 (%)94.29Unverified
3gpt-4o-mini-2024-07-18_fine_tunedF1 (%)94.09Unverified
4gpt-4o-2024-08-06F1 (%)92.2Unverified
5RobEMF1 (%)90.9Unverified
6HGF1 (%)89.8Unverified
7DittoF1 (%)89.33Unverified
8gpt-4o-mini-2024-07-18F1 (%)87.68Unverified
9Meta-Llama-3.1-8B-Instruct_fine_tunedF1 (%)87.34Unverified
10Random ForestF1 (%)85Unverified
#ModelMetricClaimedVerifiedStatus
1gpt4-0613_zeroshotF1 (%)89.61Unverified
2gpt-4o-2024-08-06_fine_tuned_wdc_smallF1 (%)87.1Unverified
3gpt-4o-mini-2024-07-18_structured_explanationsF1 (%)84.38Unverified
4gpt-4o-mini-2024-07-18F1 (%)81.61Unverified
5RoBERTa-SupConF1 (%)79.99Unverified
6Llama3.1_70B_structured_explanationsF1 (%)76.7Unverified
7Llama3.1_70BF1 (%)75.2Unverified
8Llama3.1_8B_error-based_example_selectionF1 (%)74.37Unverified
9Llama3.1_8B_structured_explanationsF1 (%)74.13Unverified
10DittoF1 (%)73.93Unverified
#ModelMetricClaimedVerifiedStatus
1BERTF1 (%)96.53Unverified
2RoBERTa-SupConF1 (%)95.21Unverified
3HGF1 (%)88.5Unverified
4DADER-MMDF1 (%)88Unverified
5DittoF1 (%)80.76Unverified
6JointBERTF1 (%)77.55Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa-SupConF1 (%)98.33Unverified
2JointBERTF1 (%)97.49Unverified
3BERTF1 (%)97.37Unverified
4HGF1 (%)96.5Unverified
5DittoF1 (%)95.45Unverified
6Random ForestF1 (%)78Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa-baseF1 (%)71.14Unverified
2DittoF1 (%)70.66Unverified
3HGF1 (%)68.74Unverified
4RoBERTa-SupConF1 (%)57.23Unverified
#ModelMetricClaimedVerifiedStatus
1HGF1 (%)94Unverified
2DADER-NoDAF1 (%)88.6Unverified
3DittoF1 (%)85.12Unverified
4JointBERTF1 (%)75.83Unverified
#ModelMetricClaimedVerifiedStatus
1ALMSER-GBF10.95Unverified
2FAMER-SplitMergeF10.88Unverified
3FAMER-SplitF10.84Unverified
#ModelMetricClaimedVerifiedStatus
1JointBERTF1 (%)97.09Unverified
2DittoF1 (%)96.53Unverified
3HGF1 (%)96.5Unverified
#ModelMetricClaimedVerifiedStatus
1RoBERTa-SupConF1 Micro88.63Unverified
2RoBERTa-baseF1 Micro52.03Unverified
#ModelMetricClaimedVerifiedStatus
1gpt-4o-2024-08-06_fine_tuned_wdc_smallF1 (%)87.07Unverified