SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 326350 of 524 papers

TitleStatusHype
IPM Move Planner: AN EFFICIENT EXPLOITING DEEP REINFORCEMENT LEARNING WITH MONTE CARLO TREE SEARCH0
Jointly Complementary&Competitive Influence Maximization with Concurrent Ally-Boosting and Rival-Preventing0
JPEG Artifacts Reduction via Deep Convolutional Sparse Coding0
Knowledge Graph Guided Evaluation of Abstention Techniques0
L0-norm Sparse Graph-regularized SVD for Biclustering0
LadaBERT: Lightweight Adaptation of BERT through Hybrid Model Compression0
Large Language Model-driven Multi-Agent Simulation for News Diffusion Under Different Network Structures0
Learned Block-based Hybrid Image Compression0
LearnedKV: Integrating LSM and Learned Index for Superior Performance on Storage0
Learned Video Compression0
Learning a Discrete Set of Optimal Allocation Rules in a Queueing System with Unknown Service Rate0
Learning a Virtual Codec Based on Deep Convolutional Neural Network to Compress Image0
Learning by Inertia: Self-supervised Monocular Visual Odometry for Road Vehicles0
Learning CNF Blocking for Large-scale Author Name Disambiguation0
Learning Dual Priors for JPEG Compression Artifacts Removal0
Learning to Succeed while Teaching to Fail: Privacy in Closed Machine Learning Systems0
Learning to Use Learners' Advice0
Leveraging Language Models for Automated Patient Record Linkage0
Leveraging large language models for efficient representation learning for entity resolution0
Leveraging Large Language Models to Predict Antibody Biological Activity Against Influenza A Hemagglutinin0
LithOS: An Operating System for Efficient Machine Learning on GPUs0
Local SGD Meets Asynchrony0
Long-distance anaphors and the blocking effect revisited-An East Asian perspective0
Look Before You Leap: Safe Model-Based Reinforcement Learning with Human Intervention0
LOS/NLOS Estimators for mmWave Cellular Systems With Blockages0
Show:102550
← PrevPage 14 of 21Next →

No leaderboard results yet.