SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 51100 of 431 papers

TitleStatusHype
TabulaTime: A Novel Multimodal Deep Learning Framework for Advancing Acute Coronary Syndrome Prediction through Environmental and Clinical Data Integration0
Intermediate triple table: A general architecture for virtual knowledge graphsCode0
Update hydrological states or meteorological forcings? Comparing data assimilation methods for differentiable hydrologic models0
Integrating Weather Station Data and Radar for Precipitation Nowcasting: SmaAt-fUsion and SmaAt-Krige-GNetCode0
Cross-Format Retrieval-Augmented Generation in XR with LLMs for Context-Aware Maintenance Assistance0
Advancing Precision Oncology Through Modeling of Longitudinal and Multimodal Data0
LLMs in Software Security: A Survey of Vulnerability Detection Techniques and InsightsCode0
Synthetic Poisoning Attacks: The Impact of Poisoned MRI Image on U-Net Brain Tumor Segmentation0
Ilargi: a GPU Compatible Factorized ML Model Training Framework0
Common Foundations for SHACL, ShEx, and PG-Schema0
Targeted Data Fusion for Causal Survival Analysis Under Distribution Shift0
A technical review of multi-omics data integration methods: from classical statistical to deep generative approaches0
PulmoFusion: Advancing Pulmonary Health with Efficient Multi-Modal FusionCode0
Fuzzy Integration of Data Lake Tables0
Transforming Social Science Research with Transfer Learning: Social Science Survey Data Integration with AI0
RecKG: Knowledge Graph for Recommender SystemsCode1
Graph-based Retrieval Augmented Generation for Dynamic Few-shot Text Classification0
IAM: Enhancing RGB-D Instance Segmentation with New BenchmarksCode0
Real-time Cross-modal Cybersickness Prediction in Virtual Reality0
Column Property Annotation using Large Language ModelsCode1
Mind the Data Gap: Bridging LLMs to Enterprise Data Integration0
Semantic Web: Past, Present, and FutureCode0
Learn2Mix: Training Neural Networks Using Adaptive Data IntegrationCode0
GraphSeqLM: A Unified Graph Language Framework for Omic Graph LearningCode0
Federated Learning for Coronary Artery Plaque Detection in Atherosclerosis Using IVUS Imaging: A Multi-Hospital Collaboration0
Clinical Trials Ontology Engineering with Large Language Models0
Knowledge Graphs: The Future of Data Integration and Insightful Discovery0
Demonstrating Data-to-Knowledge Pipelines for Connecting Production Sites in the World Wide Lab0
Data Integration with Fusion Searchlight: Classifying Brain States from Resting-state fMRICode0
Advancements in Machine Learning and Deep Learning for Early Detection and Management of Mental Health Disorder0
MISFEAT: Feature Selection for Subgroups with Systematic Missing Data0
Multi-Layer Privacy-Preserving Record Linkage with Clerical Review based on gradual information disclosure0
Adult Glioma Segmentation in Sub-Saharan Africa using Transfer Learning on Stratified Finetuning Data0
Contextual Data Integration for Bike-sharing Demand Prediction with Graph Neural Networks in Degraded Weather Conditions0
Enhancing Function-Calling Capabilities in LLMs: Strategies for Prompt Formats, Data Integration, and Multilingual Translation0
Towards Unified Molecule-Enhanced Pathology Image Representation Learning via Integrating Spatial TranscriptomicsCode1
Skeleton Detection Using Dual Radars with Integration of Dual-View CNN Models and mmPose0
Multimodal Alignment and Fusion: A Survey0
Optimal Estimation of Shared Singular Subspaces across Multiple Noisy Matrices0
Federated Learning in Chemical Engineering: A Tutorial on a Framework for Privacy-Preserving Collaboration Across Distributed Data SourcesCode0
Unlocking Historical Clinical Trial Data with ALIGN: A Compositional Large Language Model System for Medical Coding0
IoT-Based 3D Pose Estimation and Motion Optimization for Athletes: Application of C3D and OpenPose0
Multi-layer matrix factorization for cancer subtyping using full and partial multi-omics dataset0
Benchmarking pre-trained text embedding models in aligning built asset informationCode0
Multi-Task Adversarial Variational Autoencoder for Estimating Biological Brain Age with Multimodal NeuroimagingCode0
Weakly-Supervised Multimodal Learning on MIMIC-CXRCode0
Gaussian Process Emulators for Few-Shot Segmentation in Cardiac MRICode0
TableGPT2: A Large Multimodal Model with Tabular Data IntegrationCode4
Enhancing Glucose Level Prediction of ICU Patients through Hierarchical Modeling of Irregular Time-SeriesCode0
Reducing Biases in Record Matching Through Scores CalibrationCode0
Show:102550
← PrevPage 2 of 9Next →

No leaderboard results yet.