| DIG: A Turnkey Library for Diving into Graph Deep Learning Research | Mar 23, 2021 | BenchmarkingDeep Learning | —Unverified | 0 |
| DiLiGenT102: A Photometric Stereo Benchmark Dataset With Controlled Shape and Material Variation | Jan 1, 2022 | Benchmarking | —Unverified | 0 |
| DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models | Jun 5, 2025 | BenchmarkingDiversity | —Unverified | 0 |
| DiPCo -- Dinner Party Corpus | Sep 30, 2019 | Benchmarking | —Unverified | 0 |
| DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning | Jun 15, 2023 | BenchmarkingConversational Question Answering | —Unverified | 0 |
| Disability prediction in multiple sclerosis using performance outcome measures and demographic data | Apr 8, 2022 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Disambiguation in Conversational Question Answering in the Era of LLM: A Survey | May 18, 2025 | BenchmarkingConversational Question Answering | —Unverified | 0 |
| DISC: a Dataset for Integrated Sensing and Communication in mmWave Systems | Jun 15, 2023 | Activity RecognitionBenchmarking | —Unverified | 0 |
| DISCOMAN: Dataset of Indoor SCenes for Odometry, Mapping And Navigation | Sep 26, 2019 | BenchmarkingPanoptic Segmentation | —Unverified | 0 |
| Discosuite - A parser test suite for German discontinuous structures | May 1, 2014 | BenchmarkingConstituency Parsing | —Unverified | 0 |
| Discovering Visual Concept Structure with Sparse and Incomplete Tags | May 30, 2017 | BenchmarkingClustering | —Unverified | 0 |
| Discriminating modelling approaches for Point in Time Economic Scenario Generation | Aug 19, 2021 | Benchmarking | —Unverified | 0 |
| Discriminative Link Prediction using Local Links, Node Features and Community Structure | Oct 17, 2013 | BenchmarkingClustering | —Unverified | 0 |
| Disentangling coincident cell events using deep transfer learning and compressive sensing | Jul 17, 2025 | BenchmarkingCompressive Sensing | —Unverified | 0 |
| DISL: Fueling Research with A Large Dataset of Solidity Smart Contracts | Mar 25, 2024 | Benchmarking | —Unverified | 0 |
| DiS-ReX: A Multilingual Dataset for Distantly Supervised Relation Extraction | Sep 17, 2021 | BenchmarkingRelation | —Unverified | 0 |
| Distortion-adaptive Salient Object Detection in 360^ Omnidirectional Images | Sep 11, 2019 | Benchmarkingobject-detection | —Unverified | 0 |
| Distributed Evolution Strategies with Multi-Level Learning for Large-Scale Black-Box Optimization | Oct 9, 2023 | Benchmarking | —Unverified | 0 |
| Distributed Software-Defined Network Architecture for Smart Grid Resilience to Denial-of-Service Attacks | Dec 20, 2022 | Benchmarking | —Unverified | 0 |
| Distributed Training Large-Scale Deep Architectures | Aug 10, 2017 | BenchmarkingDeep Learning | —Unverified | 0 |
| Distribution-Based Invariant Deep Networks for Learning Meta-Features | Jun 24, 2020 | BenchmarkingGeneral Classification | —Unverified | 0 |
| Sensitivity analysis and experimental evaluation of PID-like continuous sliding mode control | Aug 13, 2022 | BenchmarkingSensitivity | —Unverified | 0 |
| Diverse Community Data for Benchmarking Data Privacy Algorithms | Jun 20, 2023 | Benchmarking | —Unverified | 0 |
| DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs (Extended) | Nov 18, 2019 | BenchmarkingCPU | —Unverified | 0 |
| DLUE: Benchmarking Document Language Understanding | May 16, 2023 | BenchmarkingDocument Classification | —Unverified | 0 |
| DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| A Sober Look at the Robustness of CLIPs to Spurious Features | Mar 18, 2024 | Benchmarking | —Unverified | 0 |
| Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields | Aug 11, 2023 | Benchmarking | —Unverified | 0 |
| Does imputation matter? Benchmark for predictive models | Jul 6, 2020 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts | Sep 22, 2023 | ArticlesBenchmarking | —Unverified | 0 |
| Domain Aligned CLIP for Few-shot Classification | Nov 15, 2023 | BenchmarkingClassification | —Unverified | 0 |
| Domain Generalization in Computational Pathology: Survey and Guidelines | Oct 30, 2023 | BenchmarkingDiagnostic | —Unverified | 0 |
| Don't stack layers in graph neural networks, wire them randomly | Jan 1, 2021 | AttributeBenchmarking | —Unverified | 0 |
| Downsampling and geometric feature methods for EEG classification tasks with CNNs | Oct 10, 2020 | BenchmarkingEEG | —Unverified | 0 |
| On the Convergence of Differentially Private Federated Learning on Non-Lipschitz Objectives, and with Normalized Client Updates | Jun 13, 2021 | BenchmarkingFederated Learning | —Unverified | 0 |
| DPO: A Differential and Pointwise Control Approach to Reinforcement Learning | Apr 24, 2024 | Benchmarkingreinforcement-learning | —Unverified | 0 |
| DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images | Apr 5, 2023 | BenchmarkingData Augmentation | —Unverified | 0 |
| Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks | Aug 19, 2021 | BenchmarkingClassification | —Unverified | 0 |
| DRIV100: In-The-Wild Multi-Domain Dataset and Evaluation for Real-World Domain Adaptation of Semantic Segmentation | Jan 30, 2021 | BenchmarkingDomain Adaptation | —Unverified | 0 |
| DSLOB: A Synthetic Limit Order Book Dataset for Benchmarking Forecasting Algorithms under Distributional Shift | Nov 17, 2022 | BenchmarkingTime Series | —Unverified | 0 |
| Dual Encoder-Decoder based Generative Adversarial Networks for Disentangled Facial Representation Learning | Sep 19, 2019 | BenchmarkingDecoder | —Unverified | 0 |
| Dual Task Framework for Improving Persona-grounded Dialogue Dataset | Feb 11, 2022 | Benchmarking | —Unverified | 0 |
| DyFEn: Agent-Based Fee Setting in Payment Channel Networks | Oct 15, 2022 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking | Nov 30, 2021 | BenchmarkingNatural Language Understanding | —Unverified | 0 |
| Dyna-bAbI: unlocking bAbI’s potential with dynamic synthetic benchmarking | Jul 1, 2022 | BenchmarkingNatural Language Understanding | —Unverified | 0 |
| Dynabench: Rethinking Benchmarking in NLP | Apr 7, 2021 | Benchmarking | —Unverified | 0 |
| Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking | May 21, 2021 | Benchmarking | —Unverified | 0 |
| Dynamic benchmarking framework for LLM-based conversational data capture | Feb 4, 2025 | Benchmarking | —Unverified | 0 |
| Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views | Feb 23, 2023 | Benchmarking | —Unverified | 0 |
| Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination | Mar 6, 2025 | Benchmarking | —Unverified | 0 |