SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 5175 of 524 papers

TitleStatusHype
Generate, Prune, Select: A Pipeline for Counterspeech Generation against Online Hate SpeechCode1
GraphSHA: Synthesizing Harder Samples for Class-Imbalanced Node ClassificationCode1
Backdoor Attacks on Vision TransformersCode1
COAST: COntrollable Arbitrary-Sampling NeTwork for Compressive SensingCode1
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text SpottingCode1
Augmenting Rule-based DNS Censorship Detection at Scale with Machine LearningCode0
ML-CB: Machine Learning Canvas BlockCode0
A Systematic Approach to Blocking Convolutional Neural NetworksCode0
Learning to Customize Network Security RulesCode0
Linking Cryptoasset Attribution Tags to Knowledge Graph Entities: An LLM-based ApproachCode0
Learning a Virtual Codec Based on Deep Convolutional Neural Network to Compress ImageCode0
Learning to Discriminate Perturbations for Blocking Adversarial Attacks in Text ClassificationCode0
Multi-Channel Deep Networks for Block-Based Image Compressive SensingCode0
The dynamic interplay between in-context and in-weight learning in humans and neural networksCode0
A Rule Mining-Based Advanced Persistent Threats Detection SystemCode0
BAARD: Blocking Adversarial Examples by Testing for Applicability, Reliability and DecidabilityCode0
How Useful is Intermittent, Asynchronous Expert Feedback for Bayesian Optimization?Code0
ISLAND: Interpolating Land Surface Temperature using land coverCode0
From Neural Re-Ranking to Neural Ranking: Learning a Sparse Representation for Inverted IndexingCode0
A Stochastic Approximation Approach for Efficient Decentralized Optimization on Random NetworksCode0
Foundation for unbiased cross-validation of spatio-temporal models for species distribution modelingCode0
AdVersarial: Perceptual Ad Blocking meets Adversarial Machine LearningCode0
Effective Tensor Completion via Element-wise Weighted Low-rank Tensor Train with Overlapping Ket AugmentationCode0
Efficient MPI-based Communication for GPU-Accelerated Dask ApplicationsCode0
Emergent Complexity via Multi-Agent CompetitionCode0
Show:102550
← PrevPage 3 of 21Next →

No leaderboard results yet.