SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 501524 of 524 papers

TitleStatusHype
DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic RegressionCode0
AdVersarial: Perceptual Ad Blocking meets Adversarial Machine LearningCode0
DiffIM: Differentiable Influence Minimization with Surrogate Modeling and Continuous RelaxationCode0
Using Explainable AI and Transfer Learning to understand and predict the maintenance of Atlantic blocking with limited observational dataCode0
System Identification with Biophysical Constraints: A Circuit Model of the Inner RetinaCode0
node2bits: Compact Time- and Attribute-aware Node Representations for User StitchingCode0
Detecting DGA domains with recurrent neural networks and side informationCode0
Concentration inequality for U-statistics of order two for uniformly ergodic Markov chainsCode0
Evaluating Blocking Biases in Entity MatchingCode0
A Rule Mining-Based Advanced Persistent Threats Detection SystemCode0
Blocking of the CD80/86 axis as a therapeutic approach to prevent progression to more severe forms of COVID-19Code0
Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning ApproachCode0
Scaling the Wild: Decentralizing Hogwild!-style Shared-memory SGDCode0
Learning to Customize Network Security RulesCode0
Learning to Discriminate Perturbations for Blocking Adversarial Attacks in Text ClassificationCode0
Cleaning Noisy and Heterogeneous Metadata for Record Linking Across Scholarly Big DatasetsCode0
SC-Block: Supervised Contrastive Blocking within Entity Resolution PipelinesCode0
No-Reference Image Quality Assessment in the Spatial DomainCode0
Not All Videos Become Outdated: Short-Video Recommendation by Learning to Deconfound Release Interval BiasCode0
Destruction of Image Steganography using Generative Adversarial NetworksCode0
Linking Cryptoasset Attribution Tags to Knowledge Graph Entities: An LLM-based ApproachCode0
Ethnicity sensitive author disambiguation using semi-supervised learningCode0
Pushing the Limits of Extreme Weather: Constructing Extreme Heatwave Storylines with Differentiable Climate ModelsCode0
S-DABT: Schedule and Dependency-Aware Bug Triage in Open-Source Bug Tracking SystemsCode0
Show:102550
← PrevPage 11 of 11Next →

No leaderboard results yet.