SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 151200 of 431 papers

TitleStatusHype
Graph Integration for Diffusion-Based Manifold AlignmentCode0
Multimodal Quantum Natural Language Processing: A Novel Framework for using Quantum Methods to Analyse Real DataCode0
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction0
metasnf: Meta Clustering with Similarity Network Fusion in R0
Development of CODO: A Comprehensive Tool for COVID-19 Data Representation, Analysis, and Visualization0
The S2 Hierarchical Discrete Global Grid as a Nexus for Data Representation, Integration, and Querying Across Geospatial Knowledge Graphs0
KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs0
Unified Representation of Genomic and Biomedical Concepts through Multi-Task, Multi-Source Contrastive Learning0
InstructBioMol: Advancing Biomolecule Understanding and Design Following Human Instructions0
Multilayer network approaches to omics data integration in Digital Twins for cancer research0
Empowering Cognitive Digital Twins with Generative Foundation Models: Developing a Low-Carbon Integrated Freight Transportation System0
Assumption-Lean Post-Integrated Inference with Negative Control Outcomes0
Deep learning-based Visual Measurement Extraction within an Adaptive Digital Twin Framework from Limited Data Using Transfer Learning0
Comparative Analysis of Multi-Omics Integration Using Advanced Graph Neural Networks for Cancer ClassificationCode0
Nested Deep Learning Model Towards A Foundation Model for Brain Signal Data0
Enhancing End Stage Renal Disease Outcome Prediction: A Multi-Sourced Data-Driven Approach0
Evaluating Blocking Biases in Entity MatchingCode0
Design and Evaluation of a CDSS for Drug Allergy Management Using LLMs and Pharmaceutical Data Integration0
Multi-omics data integration for early diagnosis of hepatocellular carcinoma (HCC) using machine learning0
Beyond designer's knowledge: Generating materials design hypotheses via large language models0
Multi-faceted Neuroimaging Data Integration via Analysis of Subspaces0
Prompt-Matcher: Leveraging Large Models to Reduce Uncertainty in Schema Matching Results0
EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods0
Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney PathologyCode0
EraW-Net: Enhance-Refine-Align W-Net for Scene-Associated Driver Attention Estimation0
Personalized graph feature-based multi-omics data integration for cancer subtype identification0
MMREC: LLM Based Multi-Modal Recommender System0
Enhancing Medical Learning and Reasoning Systems: A Boxology-Based Comparative Analysis of Design Patterns0
VITAL: Interactive Few-Shot Imitation Learning via Visual Human-in-the-Loop Corrections0
Intelligent Cross-Organizational Process Mining: A Survey and New Perspectives0
AtomAgents: Alloy design and discovery through physics-aware multi-modal multi-agent artificial intelligence0
Multi-Modal Dataset Creation for Federated Learning with DICOM Structured Reports0
DALL-M: Context-Aware Clinical Data Augmentation with LLMsCode0
TCKAN:A Novel Integrated Network Model for Predicting Mortality Risk in Sepsis Patients0
AI-driven multi-omics integration for multi-scale predictive modeling of causal genotype-environment-phenotype relationships0
Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions0
Entropic Optimal Transport Eigenmaps for Nonlinear Alignment and Joint Embedding of High-Dimensional DatasetsCode0
Multimodal Data Integration for Precision Oncology: Challenges and Future Directions0
Optimal Transport for Latent Integration with An Application to Heterogeneous Neuronal Activity Data0
Multi-object Data Integration in the Study of Primary Progressive Aphasia0
Moment-based parameter inference with error guarantees for stochastic reaction networksCode0
Data Issues in Industrial AI System: A Meta-Review and Research Strategy0
IoT-Based Preventive Mental Health Using Knowledge Graphs and Standards for Better Well-Being0
Cluster Quilting: Spectral Clustering for Patchwork Learning0
A Survey of Pipeline Tools for Data EngineeringCode0
Embedding-based Multimodal Learning on Pan-Squamous Cell Carcinomas for Improved Survival Outcomes0
Multimodal Contextualized Semantic Parsing from SpeechCode0
Gaussian Copula Models for Nonignorable Missing Data Using Auxiliary Marginal QuantilesCode0
Combining Experimental and Historical Data for Policy EvaluationCode0
Leveraging Large Language Models for Entity Matching0
Show:102550
← PrevPage 4 of 9Next →

No leaderboard results yet.