Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papadakis et al.: Blocking and Filtering Techniques for Entity Resolution: A Survey, 2020.

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 524 papers

Title	Date	Tasks	Status	Hype
SemanticDraw: Towards Real-Time Interactive Content Creation from Image Diffusion Models	Mar 14, 2024	BlockingGPU	CodeCode Available	4
Open CaptchaWorld: A Comprehensive Web-based Platform for Testing and Benchmarking Multimodal LLM Agents	May 30, 2025	BenchmarkingBlocking	CodeCode Available	2
Efficient LLM Scheduling by Learning to Rank	Aug 28, 2024	BlockingChatbot	CodeCode Available	2
AdFlush: A Real-World Deployable Machine Learning Solution for Effective Advertisement and Web Tracker Prevention	May 13, 2024	BlockingCPU	CodeCode Available	2
ScatterFormer: Efficient Voxel Transformer with Scattered Linear Attention	Jan 1, 2024	Blocking	CodeCode Available	2
Wavelet Diffusion Models are fast and scalable Image Generators	Nov 29, 2022	BlockingImage Generation	CodeCode Available	2
SINet: Extreme Lightweight Portrait Segmentation Networks with Spatial Squeeze Modules and Information Blocking Decoder	Nov 20, 2019	BlockingDecoder	CodeCode Available	2
NoLoCo: No-all-reduce Low Communication Training Method for Large Models	Jun 12, 2025	AllBlocking	CodeCode Available	1
Progent: Programmable Privilege Control for LLM Agents	Apr 16, 2025	Blocking	CodeCode Available	1
CORBA: Contagious Recursive Blocking Attacks on Multi-Agent Systems Based on Large Language Models	Feb 20, 2025	BlockingLanguage Modeling	CodeCode Available	1
Reinforcement Learning for Dynamic Resource Allocation in Optical Networks: Hype or Hope?	Feb 18, 2025	BenchmarkingBlocking	CodeCode Available	1
Gandalf the Red: Adaptive Security for LLMs	Jan 14, 2025	BlockingLanguage Modeling	CodeCode Available	1
Multi-granularity Contrastive Cross-modal Collaborative Generation for End-to-End Long-term Video Question Answering	Oct 12, 2024	Answer GenerationBlocking	CodeCode Available	1
Queue management for slo-oriented large language model serving	Jun 5, 2024	BlockingGPU	CodeCode Available	1
Efficient Interactive LLM Serving with Proxy Model-based Sequence Length Prediction	Apr 12, 2024	BlockingManagement	CodeCode Available	1
Masked Graph Autoencoder with Non-discrete Bandwidths	Feb 6, 2024	BlockingLink Prediction	CodeCode Available	1
Boosting Multi-view Stereo with Late Cost Aggregation	Jan 22, 2024	BlockingGeometric Matching	CodeCode Available	1
AutoDAN: Interpretable Gradient-Based Adversarial Attacks on Large Language Models	Oct 23, 2023	Adversarial AttackBlocking	CodeCode Available	1
A Novel Geo-Localization Method for UAV and Satellite Images Using Cross-View Consistent Attention	Sep 23, 2023	BlockingData Augmentation	CodeCode Available	1
LinkTransformer: A Unified Package for Record Linkage with Transformer Language Models	Sep 2, 2023	BlockingLanguage Modelling	CodeCode Available	1
AltDiffusion: A Multilingual Text-to-Image Diffusion Model	Aug 19, 2023	BlockingConcept Alignment	CodeCode Available	1
O^2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion Model	Aug 18, 2023	3D ReconstructionBlocking	CodeCode Available	1
GraphSHA: Synthesizing Harder Samples for Class-Imbalanced Node Classification	Jun 16, 2023	BlockingClassification	CodeCode Available	1
Path-Specific Counterfactual Fairness for Recommender Systems	Jun 5, 2023	Blockingcounterfactual	CodeCode Available	1
Road Planning for Slums via Deep Reinforcement Learning	May 22, 2023	BlockingDeep Reinforcement Learning	CodeCode Available	1

Show:10 25 50

← PrevPage 1 of 21Next →

No leaderboard results yet.