SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 351375 of 524 papers

TitleStatusHype
Machine learning-based network intrusion detection for big and imbalanced data using oversampling, stacking feature embedding and feature extraction0
Machine Learning based Post Processing Artifact Reduction in HEVC Intra Coding0
Machine Unlearning: its nature, scope, and importance for a "delete culture"0
MagicEyes: A Large Scale Eye Gaze Estimation Dataset for Mixed Reality0
Magnetic properties of photosynthetic materials - a nano scale study0
Mass campaigns with antimalarial drugs: a modelling comparison of artemether-lumefantrine and DHA-piperaquine with and without primaquine as tools for malaria control and elimination0
Massive Dimensions Reduction and Hybridization with Meta-heuristics in Deep Learning0
Matching Markets Meet LLMs: Algorithmic Reasoning with Ranked Preferences0
Matching, Unanticipated Experiences, Divorce, Flirting, Rematching, Etc0
mFabric: An Efficient and Scalable Fabric for Mixture-of-Experts Training0
MG-DVD: A Real-time Framework for Malware Variant Detection Based on Dynamic Heterogeneous Graph Learning0
MIMO Antenna Elements Effect on Chassis Modes0
Minimizing Instability in Strategy-Proof Matching Mechanism Using A Linear Programming Approach0
Minimizing the Societal Cost of Credit Card Fraud with Limited and Imbalanced Data0
ML Estimation and CRBs for Reverberation, Speech and Noise PSDs in Rank-Deficient Noise-Field0
Medha: Efficiently Serving Multi-Million Context Length LLM Inference Requests Without Approximations0
Model Extraction Attacks on Split Federated Learning0
Modeling Retinal Ganglion Cell Population Activity with Restricted Boltzmann Machines0
Modelling the ziji Blocking Effect and Constraining Bound Variable Derivations in MC-TAG with Delayed Locality0
Minimizing Robot Navigation-Graph For Position-Based Predictability By Humans0
Model predictive control with dynamic move blocking0
Model Properties for Efficient Synthesis of Nonblocking Modular Supervisors0
MUCIC at ComMA@ICON: Multilingual Gender Biased and Communal Language Identification Using N-grams and Multilingual Sentence Encoders0
Multi-perspective Memory Enhanced Network for Identifying Key Nodes in Social Networks0
Multiple Latent Space Mapping for Compressed Dark Image Enhancement0
Show:102550
← PrevPage 15 of 21Next →

No leaderboard results yet.