SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 201250 of 431 papers

TitleStatusHype
RDGCL: Reaction-Diffusion Graph Contrastive Learning for Recommendation0
Towards a Microservice-based Middleware for a Multi-hazard Early Warning System0
Cost-Effective In-Context Learning for Entity Resolution: A Design Space ExplorationCode1
Knowledge Graph Reasoning Based on Attention GCN0
A deep learning pipeline for cross-sectional and longitudinal multiview data integrationCode0
Exploring Artificial Intelligence Methods for Energy Prediction in Healthcare Facilities: An In-Depth Extended Systematic Review0
The Battleship Approach to the Low Resource Entity Matching ProblemCode0
mvlearnR and Shiny App for multiview learningCode0
Deep Learning and NLP in Cryptocurrency Forecasting: Integrating Financial, Blockchain, and Social Media Data0
Regression-Based Analysis of Multimodal Single-Cell Data Integration Strategies0
DREIFLUSS: A Minimalist Approach for Table Matching0
Knowledge Graph Representations to enhance Intensive Care Time-Series Predictions0
Multimodal Machine Learning in Image-Based and Clinical Biomedicine: Survey and Prospects0
Semantic Data Management in Data Lakes0
Bi-Encoders based Species Normalization -- Pairwise Sentence Learning to Rank0
Transformer-based Entity Legal Form ClassificationCode1
Uncertainty in Automated Ontology Matching: Lessons Learned from an Empirical Experimentation0
Entity Matching using Large Language ModelsCode1
Graph Neural Network approaches for single-cell data: A recent overview0
Multi-View Variational Autoencoder for Missing Value Imputation in Untargeted Metabolomics0
Temporally Aligning Long Audio Interviews with Questions: A Case Study in Multimodal Data IntegrationCode0
Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge GraphsCode1
MapperGPT: Large Language Models for Linking and Mapping EntitiesCode1
Building Flexible, Scalable, and Machine Learning-ready Multimodal Oncology DatasetsCode0
Smart City Digital Twin Framework for Real-Time Multi-Data Integration and Wide Public Distribution0
GAME: Generalized deep learning model towards multimodal data integration for early screening of adolescent mental disorders0
Current and future directions in network biology0
When Geoscience Meets Foundation Models: Towards General Geoscience Artificial Intelligence System0
A Multimodal Learning Framework for Comprehensive 3D Mineral Prospectivity Modeling with Jointly Learned Structure-Fluid Relationships0
Towards Lightweight Data Integration using Multi-workflow Provenance and Data ObservabilityCode1
Is your data alignable? Principled and interpretable alignability testing and integration of single-cell dataCode1
Scaling Data Science Solutions with Semantics and Machine Learning: Bosch Case0
Crowd Safety Manager: Towards Data-Driven Active Decision Support for Planning and Control of Crowd Events0
BIM-to-BRICK: Using graph modeling for IoT/BMS and spatial semantic data interoperability within digital data models of buildings0
A Primer on the Data Cleaning Pipeline0
Conditionally Invariant Representation Learning for Disentangling Cellular Heterogeneity0
Alternative Telescopic Displacement: An Efficient Multimodal Alignment MethodCode0
A Comparison of Neuroelectrophysiology Databases0
A survey on deep learning approaches for data integration in autonomous driving system0
Cross Modal Data Discovery over Structured and Unstructured Data LakesCode0
Column Type Annotation using ChatGPTCode1
AdapterEM: Pre-trained Language Model Adaptation for Generalized Entity Matching using Adapter-tuningCode0
Stochastic Biological System-of-Systems Modelling for iPSC Culture0
Incomplete Multimodal Learning for Complex Brain Disorders Prediction0
Boosting Knowledge Graph Generation from Tabular Data with RML ViewsCode2
Federated Learning over Harmonized Data Silos0
Patchwork Learning: A Paradigm Towards Integrative Analysis across Diverse Biomedical Data Sources0
Using ChatGPT for Entity MatchingCode1
Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data IntegrationCode1
EasySpider: A No-Code Visual System for Crawling the WebCode7
Show:102550
← PrevPage 5 of 9Next →

No leaderboard results yet.