SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 110 of 431 papers

TitleStatusHype
EasySpider: A No-Code Visual System for Crawling the WebCode7
TableGPT2: A Large Multimodal Model with Tabular Data IntegrationCode4
Exploring the Feasibility of Automated Data Standardization using Large Language Models for Seamless PositioningCode3
Declarative generation of RDF-star graphs from heterogeneous dataCode3
Intervention-Aware Forecasting: Breaking Historical Limits from a System PerspectiveCode3
An RML-FNML module for Python user-defined functions in Morph-KGCCode3
Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language ModelsCode3
CARTE: Pretraining and Transfer for Tabular LearningCode2
Adaptive Multi-Scale Decomposition Framework for Time Series ForecastingCode2
Boosting Knowledge Graph Generation from Tabular Data with RML ViewsCode2
Show:102550
← PrevPage 1 of 44Next →

No leaderboard results yet.