SOTAVerified

Blocking

Entity resolution (also known as entity matching, record linkage, or duplicate detection) is the task of finding records that refer to the same real-world entity across different data sources (e.g., data files, books, websites, and databases). (Source: Wikipedia)

Blocking is a crucial step in any entity resolution pipeline because a pair-wise comparison of all records across two data sources is infeasible. Blocking applies a computationally cheap method to generate a smaller set of candidate record pairs reducing the workload of the matcher. During matching a more expensive pair-wise matcher generates a final set of matching record pairs.

Survey on blocking:

Papers

Showing 76100 of 524 papers

TitleStatusHype
On-Chip and Off-Chip TIA Amplifiers for Nanopore Signal Readout Design, Performance and Challenges: A Review0
Deep Learning Meets Teleconnections: Improving S2S Predictions for European Winter WeatherCode0
Generative Classifier for Domain Generalization0
Matching, Unanticipated Experiences, Divorce, Flirting, Rematching, Etc0
Priority-Aware Preemptive Scheduling for Mixed-Priority Workloads in MoE Inference0
Identification of Minimally Restrictive Assembly Sequences using Supervisory Control Theory0
Fault Localization and State Estimation of Power Grid under Parallel Cyber-Physical Attacks0
Autellix: An Efficient Serving Engine for LLM Agents as General Programs0
Observability-Blocking Controls for Double-Integrator and Higher Order Integrator Networks0
Minimizing Instability in Strategy-Proof Matching Mechanism Using A Linear Programming Approach0
Evolving Hate Speech Online: An Adaptive Framework for Detection and Mitigation0
Linking Cryptoasset Attribution Tags to Knowledge Graph Entities: An LLM-based ApproachCode0
Democratizing AI: Open-source Scalable LLM Training on GPU-based Supercomputers0
DiffIM: Differentiable Influence Minimization with Surrogate Modeling and Continuous RelaxationCode0
Leveraging Large Language Models to Predict Antibody Biological Activity Against Influenza A Hemagglutinin0
Enhancing Model Defense Against Jailbreaks with Proactive Safety Reasoning0
Replacing the Gallium Oxide Shell with Conductive Ag: Toward a Printable and Recyclable Composite for Highly Stretchable Electronics, Electromagnetic Shielding, and Thermal Interfaces0
Foundation for unbiased cross-validation of spatio-temporal models for species distribution modelingCode0
CAMEO: Autocorrelation-Preserving Line Simplification for Lossy Time Series Compression0
Hybrid Parallel Collaborative Simulation Framework Integrating Device Physics with Circuit Dynamics for PDAE-Modeled Power Electronic Equipment0
Broadband measurements and analysis of human blocking in a 60 GHz indoor radio channel0
Network Diffuser for Placing-Scheduling Service Function Chains with Inverse Demonstration0
mFabric: An Efficient and Scalable Fabric for Mixture-of-Experts Training0
ABACUS: A FinOps Service for Cloud Cost Optimization0
Block-Based Multi-Scale Image Rescaling0
Show:102550
← PrevPage 4 of 21Next →

No leaderboard results yet.