SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 5175 of 431 papers

TitleStatusHype
Transformer-based Entity Legal Form ClassificationCode1
Web-Scale Academic Name Disambiguation: the WhoIsWho Benchmark, Leaderboard, and ToolkitCode1
Integrated community occupancy models: A framework to assess occurrence and biodiversity dynamics using multiple data sourcesCode0
Alternative Telescopic Displacement: An Efficient Multimodal Alignment MethodCode0
Integrating Heterogeneous Gene Expression Data through Knowledge Graphs for Improving Diabetes PredictionCode0
Heter-LP: A heterogeneous label propagation algorithm and its application in drug repositioningCode0
IAM: Enhancing RGB-D Instance Segmentation with New BenchmarksCode0
A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep LearningCode0
GraphSeqLM: A Unified Graph Language Framework for Omic Graph LearningCode0
Reconstructing Nonlinear Dynamical Systems from Multi-Modal Time SeriesCode0
A Unified Joint Matrix Factorization Framework for Data IntegrationCode0
AdapterEM: Pre-trained Language Model Adaptation for Generalized Entity Matching using Adapter-tuningCode0
Integrating Weather Station Data and Radar for Precipitation Nowcasting: SmaAt-fUsion and SmaAt-Krige-GNetCode0
Gaussian Process Emulators for Few-Shot Segmentation in Cardiac MRICode0
A Survey of Pipeline Tools for Data EngineeringCode0
Generalized probabilistic canonical correlation analysis for multi-modal data integration with full or partial observationsCode0
From Swath to Full-Disc: Advancing Precipitation Retrieval with Multimodal Knowledge ExpansionCode0
Gaussian Copula Models for Nonignorable Missing Data Using Auxiliary Marginal QuantilesCode0
Graph Integration for Diffusion-Based Manifold AlignmentCode0
Federated Learning in Chemical Engineering: A Tutorial on a Framework for Privacy-Preserving Collaboration Across Distributed Data SourcesCode0
Evaluating approaches for supervised semantic labelingCode0
Evaluating Blocking Biases in Entity MatchingCode0
From Classical Machine Learning to Emerging Foundation Models: Review on Multimodal Data Integration for Cancer ResearchCode0
Intermediate triple table: A general architecture for virtual knowledge graphsCode0
Elastic Coupled Co-clustering for Single-Cell Genomic DataCode0
Show:102550
← PrevPage 3 of 18Next →

No leaderboard results yet.