SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 251275 of 431 papers

TitleStatusHype
Inferring High-level Geographical Concepts via Knowledge Graph and Multi-scale Data Integration: A Case Study of C-shaped Building Pattern Recognition0
CLCLSA: Cross-omics Linked embedding with Contrastive Learning and Self Attention for multi-omics integration with incomplete multi-omics data0
Learnings from Data Integration for Augmented Language Models0
Scalable Randomized Kernel Methods for Multiview Data Integration and PredictionCode0
Assessing the Reproducibility of Machine-learning-based Biomarker Discovery in Parkinson's Disease0
Multimodal Data Integration for Oncology in the Era of Deep Neural Networks: A Review0
Mining the contribution of intensive care clinical course to outcome after traumatic brain injuryCode0
A Threefold Review on Deep Semantic Segmentation: Efficiency-oriented, Temporal and Depth-aware design0
Web-Scale Academic Name Disambiguation: the WhoIsWho Benchmark, Leaderboard, and ToolkitCode1
GeoFault: A well-founded fault ontology for interoperability in geological modeling0
A Biomedical Knowledge Graph for Biomarker Discovery in Cancer0
Unsupervised Entity Alignment for Temporal Knowledge GraphsCode1
WDC Products: A Multi-Dimensional Entity Matching BenchmarkCode1
Dynamic Basis Function Interpolation for Adaptive In Situ Data Integration in Ocean Modeling0
SOTAB: The WDC Schema.org Table Annotation BenchmarkCode0
Functional Integrative Bayesian Analysis of High-dimensional Multiplatform Genomic Data0
Integrating Multimodal Data for Joint Generative Modeling of Complex DynamicsCode1
Analytical Engines With Context-Rich Processing: Towards Efficient Next-Generation Analytics0
Accu-Help: A Machine Learning based Smart Healthcare Framework for Accurate Detection of Obsessive Compulsive Disorder0
Segment-based fusion of multi-sensor multi-scale satellite soil moisture retrievals0
Graph Neural Networks for Breast Cancer Data Integration0
OPTION: OPTImization Algorithm Benchmarking ONtology0
Privacy-preserving Deep Learning based Record Linkage0
Lipschitz-regularized gradient flows and generative particle algorithms for high-dimensional scarce dataCode0
Efficient Vertical Federated Learning Method for Ridge Regression of Large-Scale Samples via Least-Squares SolutionCode0
Show:102550
← PrevPage 11 of 18Next →

No leaderboard results yet.