SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 101150 of 431 papers

TitleStatusHype
Ontology Population using LLMs0
Neural decoding from stereotactic EEG: accounting for electrode variability across subjects0
Capturing and Anticipating User Intents in Data Analytics via Knowledge Graphs0
Matchmaker: Self-Improving Large Language Model Programs for Schema Matching0
Novel Architecture for Distributed Travel Data Integration and Service Provision Using Microservices0
Graph Integration for Diffusion-Based Manifold AlignmentCode0
Multimodal Quantum Natural Language Processing: A Novel Framework for using Quantum Methods to Analyse Real DataCode0
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction0
metasnf: Meta Clustering with Similarity Network Fusion in R0
Development of CODO: A Comprehensive Tool for COVID-19 Data Representation, Analysis, and Visualization0
The S2 Hierarchical Discrete Global Grid as a Nexus for Data Representation, Integration, and Querying Across Geospatial Knowledge Graphs0
KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs0
Unified Representation of Genomic and Biomedical Concepts through Multi-Task, Multi-Source Contrastive Learning0
InstructBioMol: Advancing Biomolecule Understanding and Design Following Human Instructions0
Empowering Cognitive Digital Twins with Generative Foundation Models: Developing a Low-Carbon Integrated Freight Transportation System0
Multilayer network approaches to omics data integration in Digital Twins for cancer research0
Deep learning-based Visual Measurement Extraction within an Adaptive Digital Twin Framework from Limited Data Using Transfer Learning0
Assumption-Lean Post-Integrated Inference with Negative Control Outcomes0
Comparative Analysis of Multi-Omics Integration Using Advanced Graph Neural Networks for Cancer ClassificationCode0
Nested Deep Learning Model Towards A Foundation Model for Brain Signal Data0
Enhancing End Stage Renal Disease Outcome Prediction: A Multi-Sourced Data-Driven Approach0
Evaluating Blocking Biases in Entity MatchingCode0
Design and Evaluation of a CDSS for Drug Allergy Management Using LLMs and Pharmaceutical Data Integration0
Multi-omics data integration for early diagnosis of hepatocellular carcinoma (HCC) using machine learning0
Fine-tuning Large Language Models for Entity MatchingCode1
Beyond designer's knowledge: Generating materials design hypotheses via large language models0
AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language ModelCode1
Multi-faceted Neuroimaging Data Integration via Analysis of Subspaces0
Prompt-Matcher: Leveraging Large Models to Reduce Uncertainty in Schema Matching Results0
EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods0
Exploring the Feasibility of Automated Data Standardization using Large Language Models for Seamless PositioningCode3
Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney PathologyCode0
Personalized graph feature-based multi-omics data integration for cancer subtype identification0
EraW-Net: Enhance-Refine-Align W-Net for Scene-Associated Driver Attention Estimation0
MMREC: LLM Based Multi-Modal Recommender System0
Enhancing Medical Learning and Reasoning Systems: A Boxology-Based Comparative Analysis of Design Patterns0
Conformal Trajectory Prediction with Multi-View Data Integration in Cooperative DrivingCode1
MSMA: Multi-agent Trajectory Prediction in Connected and Autonomous Vehicle Environment with Multi-source Data IntegrationCode1
VITAL: Interactive Few-Shot Imitation Learning via Visual Human-in-the-Loop Corrections0
Intelligent Cross-Organizational Process Mining: A Survey and New Perspectives0
AtomAgents: Alloy design and discovery through physics-aware multi-modal multi-agent artificial intelligence0
Multi-Modal Dataset Creation for Federated Learning with DICOM Structured Reports0
DALL-M: Context-Aware Clinical Data Augmentation with LLMsCode0
TCKAN:A Novel Integrated Network Model for Predicting Mortality Risk in Sepsis Patients0
AI-driven multi-omics integration for multi-scale predictive modeling of causal genotype-environment-phenotype relationships0
Leveraging Large Language Models for Integrated Satellite-Aerial-Terrestrial Networks: Recent Advances and Future Directions0
Entropic Optimal Transport Eigenmaps for Nonlinear Alignment and Joint Embedding of High-Dimensional DatasetsCode0
LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable ObjectivesCode1
Multimodal Data Integration for Precision Oncology: Challenges and Future Directions0
Optimal Transport for Latent Integration with An Application to Heterogeneous Neuronal Activity Data0
Show:102550
← PrevPage 3 of 9Next →

No leaderboard results yet.