SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 251260 of 431 papers

TitleStatusHype
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs0
Scalable Similarity Joins of Tokenized Strings0
Scaling Data Science Solutions with Semantics and Machine Learning: Bosch Case0
scICML: Information-theoretic Co-clustering-based Multi-view Learning for the Integrative Analysis of Single-cell Multi-omics data0
Secure and Differentially Private Bayesian Learning on Distributed Data0
Segment-based fusion of multi-sensor multi-scale satellite soil moisture retrievals0
Semantic Annotation for Tabular Data0
Semantic Data Management in Data Lakes0
Siamese Graph Neural Networks for Data Integration0
Simplifying Data Integration: SLM-Driven Systems for Unified Semantic Queries Across Heterogeneous Databases0
Show:102550
← PrevPage 26 of 44Next →

No leaderboard results yet.