SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 1120 of 431 papers

TitleStatusHype
Boosting Knowledge Graph Generation from Tabular Data with RML ViewsCode2
Morph-KGC: Scalable knowledge graph materialization with mapping partitionsCode2
Graph Neural Networks for Multimodal Single-Cell Data IntegrationCode2
scSSL-Bench: Benchmarking Self-Supervised Learning for Single-Cell DataCode1
KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data LakesCode1
FusDreamer: Label-efficient Remote Sensing World Model for Multimodal Data ClassificationCode1
RecKG: Knowledge Graph for Recommender SystemsCode1
Column Property Annotation using Large Language ModelsCode1
Towards Unified Molecule-Enhanced Pathology Image Representation Learning via Integrating Spatial TranscriptomicsCode1
Fine-tuning Large Language Models for Entity MatchingCode1
Show:102550
← PrevPage 2 of 44Next →

No leaderboard results yet.