SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 301350 of 431 papers

TitleStatusHype
K-shell decomposition reveals hierarchical cortical organization of the human brain0
Large Scale Record Linkage in the Presence of Missing Data0
LEAPME: Learning-based Property Matching with Embeddings0
Learnings from Data Integration for Augmented Language Models0
Leveraging Language Models for Automated Patient Record Linkage0
Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions0
Leveraging Large Language Models for Entity Matching0
Leveraging MIMIC Datasets for Better Digital Health: A Review on Open Problems, Progress Highlights, and Future Promises0
Linear Regression Evaluation of Search Engine Automatic Search Performance Based on Hadoop and R0
Linking Graph Entities with Multiplicity and Provenance0
Local Embeddings for Relational Data Integration0
Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities0
Making Table Understanding Work in Practice0
Mapping-equivalence and oid-equivalence of single-function object-creating conjunctive queries0
Matchmaker: Self-Improving Large Language Model Programs for Schema Matching0
Maximum Temperature Prediction Using Remote Sensing Data Via Convolutional Neural Network0
metasnf: Meta Clustering with Similarity Network Fusion in R0
Mind the Data Gap: Bridging LLMs to Enterprise Data Integration0
MISFEAT: Feature Selection for Subgroups with Systematic Missing Data0
MMREC: LLM Based Multi-Modal Recommender System0
Modeling multi-scale data via a network of networks0
Multicriteria Analysis Model in Sustainable Corn Farming Area Planning0
Multifaceted Context Representation using Dual Attention for Ontology Alignment0
Multi-faceted Neuroimaging Data Integration via Analysis of Subspaces0
Multi-Graph based Multi-Scenario Recommendation in Large-scale Online Video Services0
Multi-Kernel LS-SVM Based Bio-Clinical Data Integration: Applications to Ovarian Cancer0
Multi-layer matrix factorization for cancer subtyping using full and partial multi-omics dataset0
Multilayer network approaches to omics data integration in Digital Twins for cancer research0
Multi-Layer Privacy-Preserving Record Linkage with Clerical Review based on gradual information disclosure0
Multimodal Alignment and Fusion: A Survey0
Multimodal Data Integration for Oncology in the Era of Deep Neural Networks: A Review0
Multimodal Data Integration for Precision Oncology: Challenges and Future Directions0
Multimodal Data Integration for Sustainable Indoor Gardening: Tracking Anyplant with Time Series Foundation Model0
Multi-Modal Dataset Creation for Federated Learning with DICOM Structured Reports0
Multimodal Doctor-in-the-Loop: A Clinically-Guided Explainable Framework for Predicting Pathological Response in Non-Small Cell Lung Cancer0
Multimodal Generative AI for Story Point Estimation in Software Development0
Multimodal Machine Learning in Image-Based and Clinical Biomedicine: Survey and Prospects0
Multi-object Data Integration in the Study of Primary Progressive Aphasia0
Multi-omics data integration for early diagnosis of hepatocellular carcinoma (HCC) using machine learning0
Semantic interoperability based on the European Materials and Modelling Ontology and its ontological paradigm: Mereosemiotics0
Multi-task Learning for Heterogeneous Data via Integrating Shared and Task-Specific Encodings0
Multitask Learning without Label Correspondences0
Multi-View Variational Autoencoder for Missing Value Imputation in Untargeted Metabolomics0
NCBO Ontology Recommender 2.0: An Enhanced Approach for Biomedical Ontology Recommendation0
Need for Design Patterns: Interoperability Issues and Modelling Challenges for Observational Data0
NEMA: Automatic Integration of Large Network Management Databases0
Nested Deep Learning Model Towards A Foundation Model for Brain Signal Data0
Neural decoding from stereotactic EEG: accounting for electrode variability across subjects0
Novel Architecture for Distributed Travel Data Integration and Service Provision Using Microservices0
On the effects of alternative optima in context-specific metabolic model predictions0
Show:102550
← PrevPage 7 of 9Next →

No leaderboard results yet.