SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 5175 of 524 papers

TitleStatusHype
Should Graph Convolution Trust Neighbors? A Simple Causal Inference MethodCode1
Learning a Single Model with a Wide Range of Quality Factors for JPEG Image Artifacts RemovalCode1
AMP-Net: Denoising based Deep Unfolding for Compressive Image SensingCode1
AutoBlock: A Hands-off Blocking Framework for Entity MatchingCode1
Neural Text Generation with Unlikelihood TrainingCode1
An introduction to Causal Modelling0
Pushing the Limits of Extreme Weather: Constructing Extreme Heatwave Storylines with Differentiable Climate ModelsCode0
Matching Markets Meet LLMs: Algorithmic Reasoning with Ranked Preferences0
Challenges in Automated Processing of Speech from Child Wearables: The Case of Voice Type Classifier0
The Coupling Effect of Sensing Targets on the Environment for 3GPP ISAC Channels: Observation, Modeling, and Validation0
Decoding Knowledge Attribution in Mixture-of-Experts: A Framework of Basic-Refinement Collaboration and Efficiency Analysis0
Sensitivity of DC Network Representation for GIC Analysis0
Derailing Non-Answers via Logit Suppression at Output Subspace Boundaries in RLHF-Aligned Language Models0
Streamlining Resilient Kubernetes Autoscaling with Multi-Agent Systems via an Automated Online Design Framework0
Generative RLHF-V: Learning Principles from Multi-modal Human Preference0
AI-empowered Channel Estimation for Block-based Active IRS-enhanced Hybrid-field IoT Network0
ELIS: Efficient LLM Iterative Scheduling System with Response Length Predictor0
Non-Blocking Robustness Analysis in Discrete Event Systems0
Using mathematical models of heart cells to assess the safety of new pharmaceutical drugs0
Leveraging Language Models for Automated Patient Record Linkage0
LithOS: An Operating System for Efficient Machine Learning on GPUs0
Beamforming Design and Association Scheme for Multi-RIS Multi-User mmWave Systems Through Graph Neural Networks0
Improvable Students in School Choice0
Ctrl-Z: Controlling AI Agents via Resampling0
Statistical Linear Regression Approach to Kalman Filtering and Smoothing under Cyber-Attacks0
Show:102550
← PrevPage 3 of 21Next →

No leaderboard results yet.