SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 351375 of 431 papers

TitleStatusHype
Bi-Encoders based Species Normalization -- Pairwise Sentence Learning to Rank0
BIM-to-BRICK: Using graph modeling for IoT/BMS and spatial semantic data interoperability within digital data models of buildings0
Biodiversity data standards for the organization and dissemination of complex research projects and digital twins: a guide0
Brain Imaging Foundation Models, Are We There Yet? A Systematic Review of Foundation Models for Brain Imaging and Biomedical Research0
Business Entity Matching with Siamese Graph Convolutional Networks0
Can Large Language Models Unveil the Mysteries? An Exploration of Their Ability to Unlock Information in Complex Scenarios0
Capturing and Anticipating User Intents in Data Analytics via Knowledge Graphs0
CDE-Mapper: Using Retrieval-Augmented Language Models for Linking Clinical Data Elements to Controlled Vocabularies0
CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data0
Clinical Trials Ontology Engineering with Large Language Models0
Cluster Quilting: Spectral Clustering for Patchwork Learning0
Cognitive network science for understanding online social cognitions: A brief review0
Combining exome and gene expression datasets in one graphical model of disease to empower the discovery of disease mechanisms0
Common Foundations for SHACL, ShEx, and PG-Schema0
Community-Based Data Integration of Course and Job Data in Support of Personalized Career-Education Recommendations0
Computer-Assisted Analysis of Biomedical Images0
Conditionally Invariant Representation Learning for Disentangling Cellular Heterogeneity0
Context-Aware Analytics in MOM Applications0
Contextual Data Integration for Bike-sharing Demand Prediction with Graph Neural Networks in Degraded Weather Conditions0
Contrastive Entity Linkage: Mining Variational Attributes from Large Catalogs for Entity Linkage0
Control of Renewable Energy Communities using AI and Real-World Data0
Corynebacterium glutamicum regulation beyond transcription: Organizing principles and reconstruction of an extended regulatory network incorporating regulations mediated by small RNA and protein-protein interactions0
Prompt-Matcher: Leveraging Large Models to Reduce Uncertainty in Schema Matching Results0
CrashSage: A Large Language Model-Centered Framework for Contextual and Interpretable Traffic Crash Analysis0
Crop Knowledge Discovery Based on Agricultural Big Data Integration0
Show:102550
← PrevPage 15 of 18Next →

No leaderboard results yet.