SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 51100 of 524 papers

TitleStatusHype
Neural Text Generation with Unlikelihood TrainingCode1
O^2-Recon: Completing 3D Reconstruction of Occluded Objects in the Scene with a Pre-trained 2D Diffusion ModelCode1
Sudowoodo: Contrastive Self-supervised Learning for Multi-purpose Data Integration and PreparationCode1
Self-Destructing Models: Increasing the Costs of Harmful Dual Uses of Foundation ModelsCode1
ABINet++: Autonomous, Bidirectional and Iterative Language Modeling for Scene Text SpottingCode1
Relation-aware Compositional Zero-shot Learning for Attribute-Object Pair RecognitionCode0
Augmenting Rule-based DNS Censorship Detection at Scale with Machine LearningCode0
Reinforcement Learning of Self Enhancing Camera Image and Signal ProcessingCode0
A Systematic Approach to Blocking Convolutional Neural NetworksCode0
Pragmatic Fairness: Developing Policies with Outcome Disparity ControlCode0
Percival: Making In-Browser Perceptual Ad Blocking Practical With Deep LearningCode0
Physics-Informed Heterogeneous Graph Neural Networks for DC Blocker PlacementCode0
Robust one-shot estimation over shared networks in the presence of denial-of-service attacksCode0
Wide-AdGraph: Detecting Ad Trackers with a Wide Dependency Chain GraphCode0
A Rule Mining-Based Advanced Persistent Threats Detection SystemCode0
Not All Videos Become Outdated: Short-Video Recommendation by Learning to Deconfound Release Interval BiasCode0
On Calibration of LLM-based Guard Models for Reliable Content ModerationCode0
node2bits: Compact Time- and Attribute-aware Node Representations for User StitchingCode0
Neural Video Compression with Feature ModulationCode0
ML-CB: Machine Learning Canvas BlockCode0
AdVersarial: Perceptual Ad Blocking meets Adversarial Machine LearningCode0
Multi-Channel Deep Networks for Block-Based Image Compressive SensingCode0
No-Reference Image Quality Assessment in the Spatial DomainCode0
Learning to Discriminate Perturbations for Blocking Adversarial Attacks in Text ClassificationCode0
Learning a Virtual Codec Based on Deep Convolutional Neural Network to Compress ImageCode0
Linking Cryptoasset Attribution Tags to Knowledge Graph Entities: An LLM-based ApproachCode0
BAARD: Blocking Adversarial Examples by Testing for Applicability, Reliability and DecidabilityCode0
How Useful is Intermittent, Asynchronous Expert Feedback for Bayesian Optimization?Code0
ISLAND: Interpolating Land Surface Temperature using land coverCode0
From Neural Re-Ranking to Neural Ranking: Learning a Sparse Representation for Inverted IndexingCode0
ExtremeWeather: A large-scale climate dataset for semi-supervised detection, localization, and understanding of extreme weather eventsCode0
Evaluating Blocking Biases in Entity MatchingCode0
Foundation for unbiased cross-validation of spatio-temporal models for species distribution modelingCode0
A Stochastic Approximation Approach for Efficient Decentralized Optimization on Random NetworksCode0
Blocking of the CD80/86 axis as a therapeutic approach to prevent progression to more severe forms of COVID-19Code0
Effective Tensor Completion via Element-wise Weighted Low-rank Tensor Train with Overlapping Ket AugmentationCode0
Efficient MPI-based Communication for GPU-Accelerated Dask ApplicationsCode0
Emergent Complexity via Multi-Agent CompetitionCode0
The dynamic interplay between in-context and in-weight learning in humans and neural networksCode0
BlueTempNet: A Temporal Multi-network Dataset of Social Interactions in Bluesky SocialCode0
AdGraph: A Graph-Based Approach to Ad and Tracker BlockingCode0
Ethnicity sensitive author disambiguation using semi-supervised learningCode0
Detecting DGA domains with recurrent neural networks and side informationCode0
Learning to Customize Network Security RulesCode0
An efficient deep convolutional laplacian pyramid architecture for CS reconstruction at low sampling ratiosCode0
DiffIM: Differentiable Influence Minimization with Surrogate Modeling and Continuous RelaxationCode0
Deep Learning Meets Teleconnections: Improving S2S Predictions for European Winter WeatherCode0
Destruction of Image Steganography using Generative Adversarial NetworksCode0
DS-MLR: Exploiting Double Separability for Scaling up Distributed Multinomial Logistic RegressionCode0
BFRFormer: Transformer-based generator for Real-World Blind Face RestorationCode0
Show:102550
← PrevPage 2 of 11Next →

No leaderboard results yet.