SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 251275 of 431 papers

TitleStatusHype
Ring-lite: Scalable Reasoning via C3PO-Stabilized Reinforcement Learning for LLMs0
Scalable Similarity Joins of Tokenized Strings0
Scaling Data Science Solutions with Semantics and Machine Learning: Bosch Case0
scICML: Information-theoretic Co-clustering-based Multi-view Learning for the Integrative Analysis of Single-cell Multi-omics data0
Secure and Differentially Private Bayesian Learning on Distributed Data0
Segment-based fusion of multi-sensor multi-scale satellite soil moisture retrievals0
Semantic Annotation for Tabular Data0
Semantic Data Management in Data Lakes0
Siamese Graph Neural Networks for Data Integration0
Simplifying Data Integration: SLM-Driven Systems for Unified Semantic Queries Across Heterogeneous Databases0
Skeleton Detection Using Dual Radars with Integration of Dual-View CNN Models and mmPose0
Smart City Digital Twin Framework for Real-Time Multi-Data Integration and Wide Public Distribution0
Specifying, Monitoring, and Executing Workflows in Linked Data Environments0
Statistical Agnostic Regression: a machine learning method to validate regression models0
Stochastic Biological System-of-Systems Modelling for iPSC Culture0
Stratified Data Integration0
Streamlining Knowledge Graph Creation with PyRML0
Structured Matrix Completion with Applications to Genomic Data Integration0
Supervised prediction of aging-related genes from a context-specific protein interaction subnetwork0
Survive the Schema Changes: Integration of Unmanaged Data Using Deep Learning0
Synthetic Poisoning Attacks: The Impact of Poisoned MRI Image on U-Net Brain Tumor Segmentation0
Systematic Literature Review on Clinical Trial Eligibility Matching0
TabulaTime: A Novel Multimodal Deep Learning Framework for Advancing Acute Coronary Syndrome Prediction through Environmental and Clinical Data Integration0
Targeted Data Fusion for Causal Survival Analysis Under Distribution Shift0
Targeting Underrepresented Populations in Precision Medicine: A Federated Transfer Learning Approach0
Show:102550
← PrevPage 11 of 18Next →

No leaderboard results yet.