SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 276300 of 524 papers

TitleStatusHype
Optimal and Low-Complexity Dynamic Spectrum Access for RF-Powered Ambient Backscatter System with Online Reinforcement Learning0
Optimal occlusion uniformly partitions red blood cells fluxes within a microvascular network0
Optimal Vaccination Policy to Prevent Endemicity: A Stochastic Model0
Rail-only: A Low-Cost High-Performance Network for Training LLMs with Trillion Parameters0
Optimizing Cyber Defense in Dynamic Active Directories through Reinforcement Learning0
Optimizing Key-Selection for Face-based One-Time Biometrics via Morphing0
Partition-based Stability of Coalitional Games0
Partitioning Distributed Compute Jobs with Reinforcement Learning and Graph Neural Networks0
PEA265: Perceptual Assessment of Video Compression Artifacts0
Pedestrian Travel Time Estimation in Crowded Scenes0
Pointwise shape-adaptive DCT for high-quality deblocking of compressed color images0
Asynchronous Decentralized SGD with Quantized and Local Updates0
Post-Training BatchNorm Recalibration0
Potential of proteasome inhibitors to inhibit cytokine storm in critical stage COVID-19 patients0
Precision-Enhanced Human-Object Contact Detection via Depth-Aware Perspective Interaction and Object Texture Restoration0
Predicting Eye Fixations Under Distortion Using Bayesian Observers0
Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference0
Proactive Blockage Prediction for UAV assisted Handover in Future Wireless Network0
Probabilistic Blocking with An Application to the Syrian Conflict0
Probabilistic Duality for Parallel Gibbs Sampling without Graph Coloring0
Protecting User Privacy in Online Settings via Supervised Learning0
Qd-tree: Learning Data Layouts for Big Data Analytics0
Rademacher upper bounds for cross-validation errors with an application to the lasso0
Random Forest DBSCAN for USPTO Inventor Name Disambiguation0
Rating the Crisis of Online Public Opinion Using a Multi-Level Index System0
Show:102550
← PrevPage 12 of 21Next →

No leaderboard results yet.