SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 421430 of 524 papers

TitleStatusHype
Blocking Bandits0
Why Blocking Targeted Adversarial Perturbations Impairs the Ability to Learn0
Generalizing from a few environments in safety-critical reinforcement learning0
Dynamically Stable Matching0
Cleaning Noisy and Heterogeneous Metadata for Record Linking Across Scholarly Big DatasetsCode0
EXmatcher: Combining Features Based on Reference Strings and Segments to Enhance Citation Matching0
Coalitions in Repeated Games0
Hypothetical answers to continuous queries over data streams0
Percival: Making In-Browser Perceptual Ad Blocking Practical With Deep LearningCode0
Cloud Storage for Multi-Service Battery Operation (Extended Version)0
Show:102550
← PrevPage 43 of 53Next →

No leaderboard results yet.