SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 101125 of 431 papers

TitleStatusHype
Mining the contribution of intensive care clinical course to outcome after traumatic brain injuryCode0
Reducing Biases in Record Matching Through Scores CalibrationCode0
LLMs in Software Security: A Survey of Vulnerability Detection Techniques and InsightsCode0
Multi-dataset and Transfer Learning Using Gene Expression Knowledge GraphsCode0
An Empirical Meta-analysis of the Life Sciences (Linked?) Open Data on the WebCode0
Federated Learning in Chemical Engineering: A Tutorial on a Framework for Privacy-Preserving Collaboration Across Distributed Data SourcesCode0
A Survey of Pipeline Tools for Data EngineeringCode0
Multi-Task Adversarial Variational Autoencoder for Estimating Biological Brain Age with Multimodal NeuroimagingCode0
Building Flexible, Scalable, and Machine Learning-ready Multimodal Oncology DatasetsCode0
Neuro-symbolic representation learning on biological knowledge graphsCode0
An attention model to analyse the risk of agitation and urinary tract infections in people with dementiaCode0
A deep learning pipeline for cross-sectional and longitudinal multiview data integrationCode0
A Systematic Approach to Featurization for Cancer Drug Sensitivity Predictions with Deep LearningCode0
Profiling Entity Matching Benchmark TasksCode0
DALL-M: Context-Aware Clinical Data Augmentation with LLMsCode0
DANAE: a denoising autoencoder for underwater attitude estimationCode0
RetSTA: An LLM-Based Approach for Standardizing Clinical Fundus Image ReportsCode0
Scalable Randomized Kernel Methods for Multiview Data Integration and PredictionCode0
Evaluating AI capabilities in detecting conspiracy theories on YouTubeCode0
Evaluating approaches for supervised semantic labelingCode0
Enhancing Glucose Level Prediction of ICU Patients through Hierarchical Modeling of Irregular Time-SeriesCode0
Entropic Optimal Transport Eigenmaps for Nonlinear Alignment and Joint Embedding of High-Dimensional DatasetsCode0
Data Integration with Fusion Searchlight: Classifying Brain States from Resting-state fMRICode0
A Unified Joint Matrix Factorization Framework for Data IntegrationCode0
Evaluating Blocking Biases in Entity MatchingCode0
Show:102550
← PrevPage 5 of 18Next →

No leaderboard results yet.