SOTAVerified

Data Integration

Data integration (also called information integration) is the process of consolidating data from a set of heterogeneous data sources into a single uniform data set (materialized integration) or view on the data (virtual integration). Data integration pipelines involve subtasks such as schema matching, table annotation, entity resolution, value normalization, data cleansing, and data fusion. Application domains of data integration include data warehousing, data lakes, and knowledge base consolidation. Surveys on Data integration:

Papers

Showing 151200 of 431 papers

TitleStatusHype
Multi-object Data Integration in the Study of Primary Progressive Aphasia0
Moment-based parameter inference with error guarantees for stochastic reaction networksCode0
Data Issues in Industrial AI System: A Meta-Review and Research Strategy0
Cluster Quilting: Spectral Clustering for Patchwork Learning0
IoT-Based Preventive Mental Health Using Knowledge Graphs and Standards for Better Well-Being0
WeatherQA: Can Multimodal Language Models Reason about Severe Weather?Code1
A Survey of Pipeline Tools for Data EngineeringCode0
Embedding-based Multimodal Learning on Pan-Squamous Cell Carcinomas for Improved Survival Outcomes0
Multimodal Contextualized Semantic Parsing from SpeechCode0
Adaptive Multi-Scale Decomposition Framework for Time Series ForecastingCode2
Gaussian Copula Models for Nonignorable Missing Data Using Auxiliary Marginal QuantilesCode0
Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language ModelsCode3
Combining Experimental and Historical Data for Policy EvaluationCode0
Leveraging Large Language Models for Entity Matching0
Maximum Temperature Prediction Using Remote Sensing Data Via Convolutional Neural Network0
An Attention-Based Multi-Context Convolutional Encoder-Decoder Neural Network for Work Zone Traffic Impact Prediction0
Biodiversity data standards for the organization and dissemination of complex research projects and digital twins: a guide0
CAVACHON: a hierarchical variational autoencoder to integrate multi-modal single-cell dataCode0
A Gap in Time: The Challenge of Processing Heterogeneous IoT Data in Digitalized Buildings0
Intervention-Aware Forecasting: Breaking Historical Limits from a System PerspectiveCode3
Address-Specific Sustainable Accommodation Choice Through Real-World Data Integration0
Application of Artificial Intelligence in Schizophrenia Rehabilitation Management: A Systematic Scoping Review0
Development of Semantics-Based Distributed Middleware for Heterogeneous Data Integration and its Application for Drought0
ORBIT: Oak Ridge Base Foundation Model for Earth System Predictability0
Integrating Heterogeneous Gene Expression Data through Knowledge Graphs for Improving Diabetes PredictionCode0
Interactive Ontology Matching with Cost-Efficient Learning0
Adaptive Affinity-Based Generalization For MRI Imaging Segmentation Across Resource-Limited Settings0
Multicriteria Analysis Model in Sustainable Corn Farming Area Planning0
Detection of bromochloro alkanes in indoor dust using a novel CP-Seeker data integration tool0
iMD4GC: Incomplete Multimodal Data Integration to Advance Precise Treatment Response Prediction and Survival Analysis for Gastric CancerCode1
An RML-FNML module for Python user-defined functions in Morph-KGCCode3
Supervised Multiple Kernel Learning approaches for multi-omics data integrationCode0
Disambiguate Entity Matching using Large Language Models through Relation Discovery0
Declarative generation of RDF-star graphs from heterogeneous dataCode3
Advancing Explainable Autonomous Vehicle Systems: A Comprehensive Review and Research Roadmap0
A2CI: A Cloud-based, Service-oriented Geospatial Cyberinfrastructure to Support Atmospheric Research0
Developing and Deploying Industry Standards for Artificial Intelligence in Education (AIED): Challenges, Strategies, and Future Directions0
Data augmentation with automated machine learning: approaches and performance comparison with classical data augmentation methods0
ReMatch: Retrieval Enhanced Schema Matching with LLMsCode0
preon: Fast and accurate entity normalization for drug names and cancer types in precision oncologyCode0
CARTE: Pretraining and Transfer for Tabular LearningCode2
Statistical Agnostic Regression: a machine learning method to validate regression models0
Patient-Centric Knowledge Graphs: A Survey of Current Methods, Challenges, and Applications0
An Adaptive System Architecture for Multimodal Intelligent Transportation Systems0
P3LS: Partial Least Squares under Privacy Preservation0
eipy: An Open-Source Python Package for Multi-modal Data Integration using Heterogeneous EnsemblesCode1
Integrate Any Omics: Towards genome-wide data integration for patient stratificationCode2
TemporalAugmenter: An Ensemble Recurrent Based Deep Learning Approach for Signal Classification0
Analyses and Concerns in Precision Medicine: A Statistical Perspective0
Data Integration Framework for Virtual Reality Enabled Digital Twins0
Show:102550
← PrevPage 4 of 9Next →

No leaderboard results yet.