SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 151175 of 431 papers

TitleStatusHype
Easy Semantification of Bioassays0
Bi-Encoders based Species Normalization -- Pairwise Sentence Learning to Rank0
Efficient Joinable Table Discovery in Data Lakes: A High-Dimensional Similarity-Based Approach0
A big data intelligence marketplace and secure analytics experimentation platform for the aviation industry0
Analytical Engines With Context-Rich Processing: Towards Efficient Next-Generation Analytics0
Business Entity Matching with Siamese Graph Convolutional Networks0
Embedding-based Multimodal Learning on Pan-Squamous Cell Carcinomas for Improved Survival Outcomes0
Empowering Cognitive Digital Twins with Generative Foundation Models: Developing a Low-Carbon Integrated Freight Transportation System0
Empowering Digital Agriculture: A Privacy-Preserving Framework for Data Sharing and Collaborative Research0
Enhancing Bagging Ensemble Regression with Data Integration for Time Series-Based Diabetes Prediction0
Address-Specific Sustainable Accommodation Choice Through Real-World Data Integration0
Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation0
An Ontology for Defect Detection in Metal Additive Manufacturing0
Enhancing streamflow forecast and extracting insights using long-short term memory networks with data integration at continental scales0
Federated Learning: A new frontier in the exploration of multi-institutional medical imaging data0
CDE-Mapper: Using Retrieval-Augmented Language Models for Linking Clinical Data Elements to Controlled Vocabularies0
EquiNMF: Graph Regularized Multiview Nonnegative Matrix Factorization0
EraW-Net: Enhance-Refine-Align W-Net for Scene-Associated Driver Attention Estimation0
EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods0
Clinical Trials Ontology Engineering with Large Language Models0
Disambiguate Entity Matching using Large Language Models through Relation Discovery0
DINGO: an ontology for projects and grants linked data0
Exploring Artificial Intelligence Methods for Energy Prediction in Healthcare Facilities: An In-Depth Extended Systematic Review0
Exploring LLM Agents for Cleaning Tabular Machine Learning Datasets0
Diffusion Transport Alignment0
Show:102550
← PrevPage 7 of 18Next →

No leaderboard results yet.