EasySpider: A No-Code Visual System for Crawling the Web Apr 30, 2023 Data Integration Marketing
Code Code Available 7TableGPT2: A Large Multimodal Model with Tabular Data Integration Nov 4, 2024 Benchmarking Data Integration
Code Code Available 4Exploring the Feasibility of Automated Data Standardization using Large Language Models for Seamless Positioning Aug 22, 2024 Data Integration
Code Code Available 3Docs2KG: Unified Knowledge Graph Construction from Heterogeneous Documents Assisted by Large Language Models Jun 5, 2024 Data Integration graph construction
Code Code Available 3Intervention-Aware Forecasting: Breaking Historical Limits from a System Perspective May 22, 2024 Data Integration Sensitivity
Code Code Available 3An RML-FNML module for Python user-defined functions in Morph-KGC Apr 1, 2024 Data Integration Knowledge Graphs
Code Code Available 3Declarative generation of RDF-star graphs from heterogeneous data Mar 20, 2024 Data Integration Knowledge Graphs
Code Code Available 3Adaptive Multi-Scale Decomposition Framework for Time Series Forecasting Jun 6, 2024 Computational Efficiency Data Integration
Code Code Available 2CARTE: Pretraining and Transfer for Tabular Learning Feb 26, 2024 Data Integration Transfer Learning
Code Code Available 2Integrate Any Omics: Towards genome-wide data integration for patient stratification Jan 15, 2024 Data Integration Diversity
Code Code Available 2Boosting Knowledge Graph Generation from Tabular Data with RML Views May 22, 2023 Data Integration Graph Generation
Code Code Available 2Morph-KGC: Scalable knowledge graph materialization with mapping partitions Aug 25, 2022 Data Integration Knowledge Graphs
Code Code Available 2Graph Neural Networks for Multimodal Single-Cell Data Integration Mar 3, 2022 Data Integration Graph Neural Network
Code Code Available 2scSSL-Bench: Benchmarking Self-Supervised Learning for Single-Cell Data Jun 10, 2025 Benchmarking Data Augmentation
Code Code Available 1KramaBench: A Benchmark for AI Systems on Data-to-Insight Pipelines over Data Lakes Jun 6, 2025 Code Generation Data Integration
Code Code Available 1FusDreamer: Label-efficient Remote Sensing World Model for Multimodal Data Classification Mar 18, 2025 Combinatorial Optimization Contrastive Learning
Code Code Available 1RecKG: Knowledge Graph for Recommender Systems Jan 7, 2025 Attribute Data Integration
Code Code Available 1Column Property Annotation using Large Language Models Jan 1, 2025 Columns Property Annotation Column Type Annotation
Code Code Available 1Towards Unified Molecule-Enhanced Pathology Image Representation Learning via Integrating Spatial Transcriptomics Dec 1, 2024 Data Integration Representation Learning
Code Code Available 1Fine-tuning Large Language Models for Entity Matching Sep 12, 2024 Data Integration Entity Resolution
Code Code Available 1AnyMatch -- Efficient Zero-Shot Entity Matching with a Small Language Model Sep 6, 2024 Attribute AutoML
Code Code Available 1Conformal Trajectory Prediction with Multi-View Data Integration in Cooperative Driving Aug 1, 2024 Conformal Prediction Data Integration
Code Code Available 1MSMA: Multi-agent Trajectory Prediction in Connected and Autonomous Vehicle Environment with Multi-source Data Integration Jul 31, 2024 Autonomous Vehicles Data Integration
Code Code Available 1LLM See, LLM Do: Guiding Data Generation to Target Non-Differentiable Objectives Jul 1, 2024 Data Integration
Code Code Available 1WeatherQA: Can Multimodal Language Models Reason about Severe Weather? Jun 17, 2024 Data Integration
Code Code Available 1iMD4GC: Incomplete Multimodal Data Integration to Advance Precise Treatment Response Prediction and Survival Analysis for Gastric Cancer Apr 1, 2024 Data Integration Survival Analysis
Code Code Available 1eipy: An Open-Source Python Package for Multi-modal Data Integration using Heterogeneous Ensembles Jan 17, 2024 Data Integration
Code Code Available 1Cost-Effective In-Context Learning for Entity Resolution: A Design Space Exploration Dec 7, 2023 Data Integration Entity Resolution
Code Code Available 1Transformer-based Entity Legal Form Classification Oct 19, 2023 Classification Data Integration
Code Code Available 1Entity Matching using Large Language Models Oct 17, 2023 Data Integration Entity Resolution
Code Code Available 1Know2BIO: A Comprehensive Dual-View Benchmark for Evolving Biomedical Knowledge Graphs Oct 5, 2023 Data Integration Entity Alignment
Code Code Available 1MapperGPT: Large Language Models for Linking and Mapping Entities Oct 5, 2023 Anatomy Data Integration
Code Code Available 1Towards Lightweight Data Integration using Multi-workflow Provenance and Data Observability Aug 17, 2023 CPU Data Integration
Code Code Available 1Is your data alignable? Principled and interpretable alignability testing and integration of single-cell data Aug 3, 2023 Data Integration Imputation
Code Code Available 1Column Type Annotation using ChatGPT Jun 1, 2023 Column Type Annotation Data Integration
Code Code Available 1Using ChatGPT for Entity Matching May 5, 2023 Data Integration Entity Resolution
Code Code Available 1Unicorn: A Unified Multi-tasking Model for Supporting Matching Tasks in Data Integration May 1, 2023 Data Integration Entity Resolution
Code Code Available 1Web-Scale Academic Name Disambiguation: the WhoIsWho Benchmark, Leaderboard, and Toolkit Feb 23, 2023 Data Integration
Code Code Available 1Unsupervised Entity Alignment for Temporal Knowledge Graphs Feb 1, 2023 Data Integration Decoder
Code Code Available 1WDC Products: A Multi-Dimensional Entity Matching Benchmark Jan 23, 2023 Contrastive Learning Data Integration
Code Code Available 1Integrating Multimodal Data for Joint Generative Modeling of Complex Dynamics Dec 15, 2022 Data Integration Time Series
Code Code Available 1Sudowoodo: Contrastive Self-supervised Learning for Multi-purpose Data Integration and Preparation Jul 8, 2022 Blocking Contrastive Learning
Code Code Available 1Domain Adaptation for Deep Entity Resolution: A Design Space Exploration Jun 1, 2022 Data Integration Domain Adaptation
Code Code Available 1Cross-Domain Data Integration for Named Entity Disambiguation in Biomedical Text Oct 15, 2021 Data Integration Entity Disambiguation
Code Code Available 1Data Integration for Toxic Comment Classification: Making More Than 40 Datasets Easily Accessible in One Unified Format Aug 1, 2021 Data Integration Toxic Comment Classification
Code Code Available 1Contrastive Mixture of Posteriors for Counterfactual Inference, Data Integration and Fairness Jun 15, 2021 counterfactual Counterfactual Inference
Code Code Available 1Dual-Objective Fine-Tuning of BERT for Entity Matching Jun 1, 2021 Data Integration Entity Resolution
Code Code Available 1A Variational Information Bottleneck Approach to Multi-Omics Data Integration Feb 5, 2021 Data Integration
Code Code Available 1COMO: A Pipeline for Multi-Omics Data Integration in Metabolic Modeling and Drug Discovery Nov 4, 2020 Data Integration Drug Discovery
Code Code Available 1GripNet: Graph Information Propagation on Supergraph for Heterogeneous Graphs Oct 29, 2020 Data Integration Graph Representation Learning
Code Code Available 1