SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 401431 of 431 papers

TitleStatusHype
Disambiguate Entity Matching using Large Language Models through Relation Discovery0
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction0
DREIFLUSS: A Minimalist Approach for Table Matching0
Driving Digital Engineering Integration and Interoperability Through Semantic Integration of Models with Ontologies0
Drug response prediction by inferring pathway-response associations with Kernelized Bayesian Matrix Factorization0
Dynamical models for metabolomics data integration0
Dynamic Basis Function Interpolation for Adaptive In Situ Data Integration in Ocean Modeling0
Easy Semantification of Bioassays0
Efficient Joinable Table Discovery in Data Lakes: A High-Dimensional Similarity-Based Approach0
Embedding-based Multimodal Learning on Pan-Squamous Cell Carcinomas for Improved Survival Outcomes0
Empowering Cognitive Digital Twins with Generative Foundation Models: Developing a Low-Carbon Integrated Freight Transportation System0
Empowering Digital Agriculture: A Privacy-Preserving Framework for Data Sharing and Collaborative Research0
Enhancing Bagging Ensemble Regression with Data Integration for Time Series-Based Diabetes Prediction0
Enhancing End Stage Renal Disease Outcome Prediction: A Multi-Sourced Data-Driven Approach0
Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation0
Enhancing streamflow forecast and extracting insights using long-short term memory networks with data integration at continental scales0
EquiNMF: Graph Regularized Multiview Nonnegative Matrix Factorization0
EraW-Net: Enhance-Refine-Align W-Net for Scene-Associated Driver Attention Estimation0
EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods0
Exploring Artificial Intelligence Methods for Energy Prediction in Healthcare Facilities: An In-Depth Extended Systematic Review0
Exploring LLM Agents for Cleaning Tabular Machine Learning Datasets0
FadMan: Federated Anomaly Detection across Multiple Attributed Networks0
Causal Feature Selection for Algorithmic Fairness0
Fast Record Linkage for Company Entities0
Feature Selection for Data Integration with Mixed Multi-view Data0
Federated Learning: A new frontier in the exploration of multi-institutional medical imaging data0
Federated Learning for Coronary Artery Plaque Detection in Atherosclerosis Using IVUS Imaging: A Multi-Hospital Collaboration0
Federated Learning over Harmonized Data Silos0
Federated Multi-View Learning for Private Medical Data Integration and Analysis0
Deep Learning and NLP in Cryptocurrency Forecasting: Integrating Financial, Blockchain, and Social Media Data0
From Automation to Autonomy in Smart Manufacturing: A Bayesian Optimization Framework for Modeling Multi-Objective Experimentation and Sequential Decision Making0
Show:102550
← PrevPage 9 of 9Next →

No leaderboard results yet.