SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 351400 of 431 papers

TitleStatusHype
Profiling Entity Matching Benchmark TasksCode0
Multifaceted Context Representation using Dual Attention for Ontology Alignment0
Survive the Schema Changes: Integration of Unmanaged Data Using Deep Learning0
BayReL: Bayesian Relational Learning for Multi-omics Data IntegrationCode1
LEAPME: Learning-based Property Matching with Embeddings0
SumGNN: Multi-typed Drug Interaction Prediction via Efficient Knowledge Graph SummarizationCode1
Kernel learning approaches for summarising and combining posterior similarity matricesCode0
Towards a Modular Ontology for Space Weather Research0
Proceedings 36th International Conference on Logic Programming (Technical Communications)0
Understanding Reflection Needs for Personal Health Data in Diabetes0
Community-Based Data Integration of Course and Job Data in Support of Personalized Career-Education Recommendations0
DINGO: an ontology for projects and grants linked data0
Petri Nets with Parameterised Data: Modelling and Verification (Extended Version)0
Causal Feature Selection for Algorithmic Fairness0
An Empirical Meta-analysis of the Life Sciences (Linked?) Open Data on the WebCode0
NEMA: Automatic Integration of Large Network Management Databases0
Secure and Differentially Private Bayesian Learning on Distributed Data0
Consistent and Flexible Selectivity Estimation for High-Dimensional DataCode0
The scalable Birth-Death MCMC Algorithm for Mixed Graphical Model Learning with Application to Genomic Data IntegrationCode0
A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep LearningCode0
Elastic Coupled Co-clustering for Single-Cell Genomic DataCode0
Semantic interoperability based on the European Materials and Modelling Ontology and its ontological paradigm: Mereosemiotics0
Crop Knowledge Discovery Based on Agricultural Big Data Integration0
Data Warehouse and Decision Support on Integrated Crop Big Data0
Contrastive Entity Linkage: Mining Variational Attributes from Large Catalogs for Entity Linkage0
Data integration and prediction models of photovoltaic production from Brazilian northeastern0
Siamese Graph Neural Networks for Data Integration0
Enhancing streamflow forecast and extracting insights using long-short term memory networks with data integration at continental scales0
Adverse Childhood Experiences Ontology for Mental Health Surveillance, Research, and Evaluation: Advanced Knowledge Representation and Semantic Web Techniques0
Leveraging Legacy Data to Accelerate Materials Design via Preference LearningCode0
Privacy-preserving Federated Bayesian Learning of a Generative Model for Imbalanced Classification of Clinical Data0
A Pseudo-Likelihood Approach to Linear Regression with Partially Shuffled Data0
Ontology Based Information Integration: A Survey0
Proceedings 35th International Conference on Logic Programming (Technical Communications)0
Local Embeddings for Relational Data Integration0
Supervised prediction of aging-related genes from a context-specific protein interaction subnetwork0
Linking Graph Entities with Multiplicity and Provenance0
Fast Record Linkage for Company Entities0
A Survey of Data Quality Measurement and Monitoring Tools0
Reasoning about disclosure in data integration in the presence of source constraints0
Designing and Implementing Data Warehouse for Agricultural Big Data0
Feature Selection for Data Integration with Mixed Multi-view Data0
Scalable Similarity Joins of Tokenized Strings0
Hybrid Modelling in Oncology: Sucesses, Challenges and Hopes0
TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets0
Reuse and Adaptation for Entity Resolution through Transfer Learning0
Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities0
Specifying, Monitoring, and Executing Workflows in Linked Data Environments0
K-shell decomposition reveals hierarchical cortical organization of the human brain0
Joint Estimation and Inference for Data Integration Problems based on Multiple Multi-layered Gaussian Graphical ModelsCode0
Show:102550
← PrevPage 8 of 9Next →

No leaderboard results yet.