SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 176200 of 431 papers

TitleStatusHype
Interactive Ontology Matching with Cost-Efficient Learning0
Adaptive Affinity-Based Generalization For MRI Imaging Segmentation Across Resource-Limited Settings0
Multicriteria Analysis Model in Sustainable Corn Farming Area Planning0
Detection of bromochloro alkanes in indoor dust using a novel CP-Seeker data integration tool0
iMD4GC: Incomplete Multimodal Data Integration to Advance Precise Treatment Response Prediction and Survival Analysis for Gastric CancerCode1
An RML-FNML module for Python user-defined functions in Morph-KGCCode3
Supervised Multiple Kernel Learning approaches for multi-omics data integrationCode0
Disambiguate Entity Matching using Large Language Models through Relation Discovery0
Declarative generation of RDF-star graphs from heterogeneous dataCode3
Advancing Explainable Autonomous Vehicle Systems: A Comprehensive Review and Research Roadmap0
A2CI: A Cloud-based, Service-oriented Geospatial Cyberinfrastructure to Support Atmospheric Research0
Developing and Deploying Industry Standards for Artificial Intelligence in Education (AIED): Challenges, Strategies, and Future Directions0
Data augmentation with automated machine learning: approaches and performance comparison with classical data augmentation methods0
ReMatch: Retrieval Enhanced Schema Matching with LLMsCode0
preon: Fast and accurate entity normalization for drug names and cancer types in precision oncologyCode0
CARTE: Pretraining and Transfer for Tabular LearningCode2
Statistical Agnostic Regression: a machine learning method to validate regression models0
Patient-Centric Knowledge Graphs: A Survey of Current Methods, Challenges, and Applications0
An Adaptive System Architecture for Multimodal Intelligent Transportation Systems0
P3LS: Partial Least Squares under Privacy Preservation0
eipy: An Open-Source Python Package for Multi-modal Data Integration using Heterogeneous EnsemblesCode1
Integrate Any Omics: Towards genome-wide data integration for patient stratificationCode2
TemporalAugmenter: An Ensemble Recurrent Based Deep Learning Approach for Signal Classification0
Analyses and Concerns in Precision Medicine: A Statistical Perspective0
Data Integration Framework for Virtual Reality Enabled Digital Twins0
Show:102550
← PrevPage 8 of 18Next →

No leaderboard results yet.