SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 76100 of 524 papers

TitleStatusHype
Linking Cryptoasset Attribution Tags to Knowledge Graph Entities: An LLM-based ApproachCode0
BAARD: Blocking Adversarial Examples by Testing for Applicability, Reliability and DecidabilityCode0
How Useful is Intermittent, Asynchronous Expert Feedback for Bayesian Optimization?Code0
ISLAND: Interpolating Land Surface Temperature using land coverCode0
From Neural Re-Ranking to Neural Ranking: Learning a Sparse Representation for Inverted IndexingCode0
ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather eventsCode0
Evaluating Blocking Biases in Entity MatchingCode0
Foundation for unbiased cross-validation of spatio-temporal models for species distribution modelingCode0
A Stochastic Approximation Approach for Efficient Decentralized Optimization on Random NetworksCode0
Blocking of the CD80/86 axis as a therapeutic approach to prevent progression to more severe forms of COVID-19Code0
Effective Tensor Completion via Element-wise Weighted Low-rank Tensor Train with Overlapping Ket AugmentationCode0
Efficient MPI-based Communication for GPU-Accelerated Dask ApplicationsCode0
Emergent Complexity via Multi-Agent CompetitionCode0
The dynamic interplay between in-context and in-weight learning in humans and neural networksCode0
BlueTempNet: A Temporal Multi-network Dataset of Social Interactions in Bluesky SocialCode0
AdGraph: A Graph-Based Approach to Ad and Tracker BlockingCode0
Ethnicity sensitive author disambiguation using semi-supervised learningCode0
Detecting DGA domains with recurrent neural networks and side informationCode0
Learning to Customize Network Security RulesCode0
An efficient deep convolutional laplacian pyramid architecture for CS reconstruction at low sampling ratiosCode0
DiffIM: Differentiable Influence Minimization with Surrogate Modeling and Continuous RelaxationCode0
Deep Learning Meets Teleconnections: Improving S2S Predictions for European Winter WeatherCode0
Destruction of Image Steganography using Generative Adversarial NetworksCode0
DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic RegressionCode0
BFRFormer: Transformer-based generator for Real-World Blind Face RestorationCode0
Show:102550
← PrevPage 4 of 21Next →

No leaderboard results yet.