SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 151175 of 431 papers

TitleStatusHype
Graph Integration for Diffusion-Based Manifold AlignmentCode0
Multimodal Quantum Natural Language Processing: A Novel Framework for using Quantum Methods to Analyse Real DataCode0
Document Parsing Unveiled: Techniques, Challenges, and Prospects for Structured Information Extraction0
metasnf: Meta Clustering with Similarity Network Fusion in R0
Development of CODO: A Comprehensive Tool for COVID-19 Data Representation, Analysis, and Visualization0
The S2 Hierarchical Discrete Global Grid as a Nexus for Data Representation, Integration, and Querying Across Geospatial Knowledge Graphs0
KcMF: A Knowledge-compliant Framework for Schema and Entity Matching with Fine-tuning-free LLMs0
Unified Representation of Genomic and Biomedical Concepts through Multi-Task, Multi-Source Contrastive Learning0
InstructBioMol: Advancing Biomolecule Understanding and Design Following Human Instructions0
Multilayer network approaches to omics data integration in Digital Twins for cancer research0
Empowering Cognitive Digital Twins with Generative Foundation Models: Developing a Low-Carbon Integrated Freight Transportation System0
Assumption-Lean Post-Integrated Inference with Negative Control Outcomes0
Deep learning-based Visual Measurement Extraction within an Adaptive Digital Twin Framework from Limited Data Using Transfer Learning0
Comparative Analysis of Multi-Omics Integration Using Advanced Graph Neural Networks for Cancer ClassificationCode0
Nested Deep Learning Model Towards A Foundation Model for Brain Signal Data0
Enhancing End Stage Renal Disease Outcome Prediction: A Multi-Sourced Data-Driven Approach0
Evaluating Blocking Biases in Entity MatchingCode0
Design and Evaluation of a CDSS for Drug Allergy Management Using LLMs and Pharmaceutical Data Integration0
Multi-omics data integration for early diagnosis of hepatocellular carcinoma (HCC) using machine learning0
Beyond designer's knowledge: Generating materials design hypotheses via large language models0
Multi-faceted Neuroimaging Data Integration via Analysis of Subspaces0
Prompt-Matcher: Leveraging Large Models to Reduce Uncertainty in Schema Matching Results0
EUR-USD Exchange Rate Forecasting Based on Information Fusion with Large Language Models and Deep Learning Methods0
Cross-Species Data Integration for Enhanced Layer Segmentation in Kidney PathologyCode0
EraW-Net: Enhance-Refine-Align W-Net for Scene-Associated Driver Attention Estimation0
Show:102550
← PrevPage 7 of 18Next →

No leaderboard results yet.