SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 151200 of 431 papers

TitleStatusHype
Driving Digital Engineering Integration and Interoperability Through Semantic Integration of Models with Ontologies0
DREIFLUSS: A Minimalist Approach for Table Matching0
Efficient Joinable Table Discovery in Data Lakes: A High-Dimensional Similarity-Based Approach0
A big data intelligence marketplace and secure analytics experimentation platform for the aviation industry0
BIM-to-BRICK: Using graph modeling for IoT/BMS and spatial semantic data interoperability within digital data models of buildings0
Business Entity Matching with Siamese Graph Convolutional Networks0
Embedding-based Multimodal Learning on Pan-Squamous Cell Carcinomas for Improved Survival Outcomes0
Empowering Cognitive Digital Twins with Generative Foundation Models: Developing a Low-Carbon Integrated Freight Transportation System0
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction0
Enhancing Bagging Ensemble Regression with Data Integration for Time Series-Based Diabetes Prediction0
Bi-Encoders based Species Normalization -- Pairwise Sentence Learning to Rank0
Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation0
An Ontology for Defect Detection in Metal Additive Manufacturing0
Enhancing streamflow forecast and extracting insights using long-short term memory networks with data integration at continental scales0
Analytical Engines With Context-Rich Processing: Towards Efficient Next-Generation Analytics0
CDE-Mapper: Using Retrieval-Augmented Language Models for Linking Clinical Data Elements to Controlled Vocabularies0
EquiNMF: Graph Regularized Multiview Nonnegative Matrix Factorization0
EraW-Net: Enhance-Refine-Align W-Net for Scene-Associated Driver Attention Estimation0
EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods0
Clinical Trials Ontology Engineering with Large Language Models0
Address-Specific Sustainable Accommodation Choice Through Real-World Data Integration0
Cluster Quilting: Spectral Clustering for Patchwork Learning0
Exploring Artificial Intelligence Methods for Energy Prediction in Healthcare Facilities: An In-Depth Extended Systematic Review0
Exploring LLM Agents for Cleaning Tabular Machine Learning Datasets0
GridMind: A Multi-Agent NLP Framework for Unified, Cross-Modal NFL Data Insights0
Disambiguate Entity Matching using Large Language Models through Relation Discovery0
DINGO: an ontology for projects and grants linked data0
Fast Record Linkage for Company Entities0
Feature Selection for Data Integration with Mixed Multi-view Data0
Federated Learning: A new frontier in the exploration of multi-institutional medical imaging data0
Federated Learning for Coronary Artery Plaque Detection in Atherosclerosis Using IVUS Imaging: A Multi-Hospital Collaboration0
Application of Artificial Intelligence in Schizophrenia Rehabilitation Management: A Systematic Scoping Review0
Federated Learning over Harmonized Data Silos0
Federated Multi-View Learning for Private Medical Data Integration and Analysis0
Diffusion Transport Alignment0
Deep Learning and NLP in Cryptocurrency Forecasting: Integrating Financial, Blockchain, and Social Media Data0
From Automation to Autonomy in Smart Manufacturing: A Bayesian Optimization Framework for Modeling Multi-Objective Experimentation and Sequential Decision Making0
A Pseudo-Likelihood Approach to Linear Regression with Partially Shuffled Data0
Beyond designer's knowledge: Generating materials design hypotheses via large language models0
Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Genomic Data0
Analyses and Concerns in Precision Medicine: A Statistical Perspective0
Diagnostic data integration using deep neural networks for real-time plasma analysis0
GAME: Generalized deep learning model towards multimodal data integration for early screening of adolescent mental disorders0
Conditionally Invariant Representation Learning for Disentangling Cellular Heterogeneity0
Context-Aware Analytics in MOM Applications0
A robust kernel machine regression towards biomarker selection in multi-omics datasets of osteoporosis for drug discovery0
GeoFault: A well-founded fault ontology for interoperability in geological modeling0
Geospatial Narratives and their Spatio-Temporal Dynamics: Commonsense Reasoning for High-level Analyses in Geographic Information Systems0
Gesture Recognition in Robotic Surgery: a Review0
Development of Semantics-Based Distributed Middleware for Heterogeneous Data Integration and its Application for Drought0
Show:102550
← PrevPage 4 of 9Next →

No leaderboard results yet.