SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 51100 of 431 papers

TitleStatusHype
eipy: An Open-Source Python Package for Multi-modal Data Integration using Heterogeneous EnsemblesCode1
Towards Unified Molecule-Enhanced Pathology Image Representation Learning via Integrating Spatial TranscriptomicsCode1
A Cautionary Tale on Integrating Studies with Disparate Outcome Measures for Causal Inference0
A Comparison of Neuroelectrophysiology Databases0
Current and future directions in network biology0
A Theoretical Framework for Graph-based Digital Twins for Supply Chain Management and Optimization0
A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches0
AI-driven multi-omics integration for multi-scale predictive modeling of causal genotype-environment-phenotype relationships0
A Threefold Review on Deep Semantic Segmentation: Efficiency-oriented, Temporal and Depth-aware design0
AtomAgents: Alloy design and discovery through physics-aware multi-modal multi-agent artificial intelligence0
A machine-compiled macroevolutionary history of Phanerozoic life0
Segmentation in large-scale cellular electron microscopy with deep learning: A literature survey0
A Systematic Decade Review of Trip Route Planning with Travel Time Estimation based on User Preferences and Behavior0
A High Precision Pipeline for Financial Knowledge Graph Construction0
Reasoning about disclosure in data integration in the presence of source constraints0
Data augmentation with automated machine learning: approaches and performance comparison with classical data augmentation methods0
A survey on deep learning approaches for data integration in autonomous driving system0
A health telemonitoring platform based on data integration from different sources0
A Circular Construction Product Ontology for End-of-Life Decision-Making0
A Survey of Data Quality Measurement and Monitoring Tools0
Assumption-Lean Post-Integrated Inference with Negative Control Outcomes0
A Gap in Time: The Challenge of Processing Heterogeneous IoT Data in Digitalized Buildings0
Accu-Help: A Machine Learning based Smart Healthcare Framework for Accurate Detection of Obsessive Compulsive Disorder0
Assessing the Reproducibility of Machine-learning-based Biomarker Discovery in Parkinson's Disease0
A Semantic Analyzer for the Comprehension of the Spontaneous Arabic Speech0
A Framework for Accurate Drought Forecasting System Using Semantics-Based Data Integration Middleware0
A robust kernel machine regression towards biomarker selection in multi-omics datasets of osteoporosis for drug discovery0
A review of machine learning approaches, challenges and prospects for computational tumor pathology0
Adverse Childhood Experiences Ontology for Mental Health Surveillance, Research, and Evaluation: Advanced Knowledge Representation and Semantic Web Techniques0
Cross-Format Retrieval-Augmented Generation in XR with LLMs for Context-Aware Maintenance Assistance0
Crowd Safety Manager: Towards Data-Driven Active Decision Support for Planning and Control of Crowd Events0
Data Fusion by Matrix Factorization0
Advancing Precision Oncology Through Modeling of Longitudinal and Multimodal Data0
A Primer on the Data Cleaning Pipeline0
CrashSage: A Large Language Model-Centered Framework for Contextual and Interpretable Traffic Crash Analysis0
Application of Artificial Intelligence in Schizophrenia Rehabilitation Management: A Systematic Scoping Review0
Combining exome and gene expression datasets in one graphical model of disease to empower the discovery of disease mechanisms0
Advancing Explainable Autonomous Vehicle Systems: A Comprehensive Review and Research Roadmap0
Acceleration of Deep Neural Network Training with Resistive Cross-Point Devices0
Crop Knowledge Discovery Based on Agricultural Big Data Integration0
Common Foundations for SHACL, ShEx, and PG-Schema0
Community-Based Data Integration of Course and Job Data in Support of Personalized Career-Education Recommendations0
A Pseudo-Likelihood Approach to Linear Regression with Partially Shuffled Data0
Computer-Assisted Analysis of Biomedical Images0
Cognitive network science for understanding online social cognitions: A brief review0
Conditionally Invariant Representation Learning for Disentangling Cellular Heterogeneity0
Context-Aware Analytics in MOM Applications0
Contextual Data Integration for Bike-sharing Demand Prediction with Graph Neural Networks in Degraded Weather Conditions0
Contrastive Entity Linkage: Mining Variational Attributes from Large Catalogs for Entity Linkage0
Cluster Quilting: Spectral Clustering for Patchwork Learning0
Show:102550
← PrevPage 2 of 9Next →

No leaderboard results yet.