SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 231240 of 524 papers

TitleStatusHype
SOS: Systematic Offensive Stereotyping Bias in Word Embeddings0
Calibrating Sequence likelihood Improves Conditional Language Generation0
Multi-scale Attention Network for Single Image Super-ResolutionCode1
S^2-Transformer for Mask-Aware Hyperspectral Image ReconstructionCode1
EnergonAI: An Inference System for 10-100 Billion Parameter Transformer Models0
Virtual Control Group: Measuring Hidden Performance Metrics0
A hybrid transmission model for Plasmodium vivax accounting for superinfection, immunity and the hypnozoite reservoir0
Non-Blocking Batch A* (Technical Report)0
Rating the Crisis of Online Public Opinion Using a Multi-Level Index System0
Network Coexistence Analysis of RIS-Assisted Wireless Communications0
Show:102550
← PrevPage 24 of 53Next →

No leaderboard results yet.