SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 7180 of 524 papers

TitleStatusHype
Foundation for unbiased cross-validation of spatio-temporal models for species distribution modelingCode0
ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather eventsCode0
Ethnicity sensitive author disambiguation using semi-supervised learningCode0
Emergent Complexity via Multi-Agent CompetitionCode0
Evaluating Blocking Biases in Entity MatchingCode0
From Neural Re-Ranking to Neural Ranking: Learning a Sparse Representation for Inverted IndexingCode0
Efficient MPI-based Communication for GPU-Accelerated Dask ApplicationsCode0
AdGraph: A Graph-Based Approach to Ad and Tracker BlockingCode0
DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic RegressionCode0
Effective Model of Loop Extrusion Predicts Chromosomal DomainsCode0
Show:102550
← PrevPage 8 of 53Next →

No leaderboard results yet.