SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 51100 of 524 papers

TitleStatusHype
Sudowoodo: Contrastive Self-supervised Learning for Multi-purpose Data Integration and PreparationCode1
Boosting Multi-view Stereo with Late Cost AggregationCode1
Path-Specific Counterfactual Fairness for Recommender SystemsCode1
Time-Ordered Recent Event (TORE) Volumes for Event CamerasCode1
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text SpottingCode1
Robust one-shot estimation over shared networks in the presence of denial-of-service attacksCode0
Augmenting Rule-based DNS Censorship Detection at Scale with Machine LearningCode0
Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair RecognitionCode0
A Systematic Approach to Blocking Convolutional Neural NetworksCode0
Pragmatic Fairness: Developing Policies with Outcome Disparity ControlCode0
Reinforcement Learning of Self Enhancing Camera Image and Signal ProcessingCode0
SAH: Shifting-aware Asymmetric Hashing for Reverse k-Maximum Inner Product SearchCode0
A Rule Mining-Based Advanced Persistent Threats Detection SystemCode0
Wide-AdGraph: Detecting Ad Trackers with a Wide Dependency Chain GraphCode0
On Calibration of LLM-based Guard Models for Reliable Content ModerationCode0
Percival: Making In-Browser Perceptual Ad Blocking Practical With Deep LearningCode0
No-Reference Image Quality Assessment in the Spatial DomainCode0
Not All Videos Become Outdated: Short-Video Recommendation by Learning to Deconfound Release Interval BiasCode0
Multi-Channel Deep Networks for Block-Based Image Compressive SensingCode0
AdVersarial: Perceptual Ad Blocking meets Adversarial Machine LearningCode0
node2bits: Compact Time- and Attribute-aware Node Representations for User StitchingCode0
Physics-Informed Heterogeneous Graph Neural Networks for DC Blocker PlacementCode0
Exploration Policies for On-the-Fly Controller Synthesis: A Reinforcement Learning ApproachCode0
Linking Cryptoasset Attribution Tags to Knowledge Graph Entities: An LLM-based ApproachCode0
Learning to Customize Network Security RulesCode0
BAARD: Blocking Adversarial Examples by Testing for Applicability, Reliability and DecidabilityCode0
How Useful is Intermittent, Asynchronous Expert Feedback for Bayesian Optimization?Code0
ISLAND: Interpolating Land Surface Temperature using land coverCode0
From Neural Re-Ranking to Neural Ranking: Learning a Sparse Representation for Inverted IndexingCode0
ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather eventsCode0
Evaluating Blocking Biases in Entity MatchingCode0
Foundation for unbiased cross-validation of spatio-temporal models for species distribution modelingCode0
A Stochastic Approximation Approach for Efficient Decentralized Optimization on Random NetworksCode0
Effective Tensor Completion via Element-wise Weighted Low-rank Tensor Train with Overlapping Ket AugmentationCode0
Blocking of the CD80/86 axis as a therapeutic approach to prevent progression to more severe forms of COVID-19Code0
Efficient MPI-based Communication for GPU-Accelerated Dask ApplicationsCode0
Emergent Complexity via Multi-Agent CompetitionCode0
AdGraph: A Graph-Based Approach to Ad and Tracker BlockingCode0
The dynamic interplay between in-context and in-weight learning in humans and neural networksCode0
BlueTempNet: A Temporal Multi-network Dataset of Social Interactions in Bluesky SocialCode0
Ethnicity sensitive author disambiguation using semi-supervised learningCode0
Detecting DGA domains with recurrent neural networks and side informationCode0
An efficient deep convolutional laplacian pyramid architecture for CS reconstruction at low sampling ratiosCode0
Learning to Discriminate Perturbations for Blocking Adversarial Attacks in Text ClassificationCode0
DiffIM: Differentiable Influence Minimization with Surrogate Modeling and Continuous RelaxationCode0
Deep Learning Meets Teleconnections: Improving S2S Predictions for European Winter WeatherCode0
Destruction of Image Steganography using Generative Adversarial NetworksCode0
DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic RegressionCode0
BFRFormer: Transformer-based generator for Real-World Blind Face RestorationCode0
Deep Convolution Networks for Compression Artifacts ReductionCode0
Show:102550
← PrevPage 2 of 11Next →

No leaderboard results yet.