SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 376400 of 431 papers

TitleStatusHype
Data integration and prediction models of photovoltaic production from Brazilian northeastern0
Siamese Graph Neural Networks for Data Integration0
Enhancing streamflow forecast and extracting insights using long-short term memory networks with data integration at continental scales0
Adverse Childhood Experiences Ontology for Mental Health Surveillance, Research, and Evaluation: Advanced Knowledge Representation and Semantic Web Techniques0
Leveraging Legacy Data to Accelerate Materials Design via Preference LearningCode0
Privacy-preserving Federated Bayesian Learning of a Generative Model for Imbalanced Classification of Clinical Data0
A Pseudo-Likelihood Approach to Linear Regression with Partially Shuffled Data0
Ontology Based Information Integration: A Survey0
Proceedings 35th International Conference on Logic Programming (Technical Communications)0
Local Embeddings for Relational Data Integration0
Supervised prediction of aging-related genes from a context-specific protein interaction subnetwork0
Linking Graph Entities with Multiplicity and Provenance0
Fast Record Linkage for Company Entities0
A Survey of Data Quality Measurement and Monitoring Tools0
Reasoning about disclosure in data integration in the presence of source constraints0
Designing and Implementing Data Warehouse for Agricultural Big Data0
Feature Selection for Data Integration with Mixed Multi-view Data0
Scalable Similarity Joins of Tokenized Strings0
Hybrid Modelling in Oncology: Sucesses, Challenges and Hopes0
TweetsKB: A Public and Large-Scale RDF Corpus of Annotated Tweets0
Reuse and Adaptation for Entity Resolution through Transfer Learning0
Machine Learning for Integrating Data in Biology and Medicine: Principles, Practice, and Opportunities0
Specifying, Monitoring, and Executing Workflows in Linked Data Environments0
K-shell decomposition reveals hierarchical cortical organization of the human brain0
Joint Estimation and Inference for Data Integration Problems based on Multiple Multi-layered Gaussian Graphical ModelsCode0
Show:102550
← PrevPage 16 of 18Next →

No leaderboard results yet.