| DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| A Sober Look at the Robustness of CLIPs to Spurious Features | Mar 18, 2024 | Benchmarking | —Unverified | 0 |
| Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields | Aug 11, 2023 | Benchmarking | —Unverified | 0 |
| Does imputation matter? Benchmark for predictive models | Jul 6, 2020 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts | Sep 22, 2023 | ArticlesBenchmarking | —Unverified | 0 |
| Domain Aligned CLIP for Few-shot Classification | Nov 15, 2023 | BenchmarkingClassification | —Unverified | 0 |
| Domain Generalization in Computational Pathology: Survey and Guidelines | Oct 30, 2023 | BenchmarkingDiagnostic | —Unverified | 0 |
| Don't stack layers in graph neural networks, wire them randomly | Jan 1, 2021 | AttributeBenchmarking | —Unverified | 0 |
| Downsampling and geometric feature methods for EEG classification tasks with CNNs | Oct 10, 2020 | BenchmarkingEEG | —Unverified | 0 |
| On the Convergence of Differentially Private Federated Learning on Non-Lipschitz Objectives, and with Normalized Client Updates | Jun 13, 2021 | BenchmarkingFederated Learning | —Unverified | 0 |
| DPO: A Differential and Pointwise Control Approach to Reinforcement Learning | Apr 24, 2024 | Benchmarkingreinforcement-learning | —Unverified | 0 |
| DRAC: Diabetic Retinopathy Analysis Challenge with Ultra-Wide Optical Coherence Tomography Angiography Images | Apr 5, 2023 | BenchmarkingData Augmentation | —Unverified | 0 |
| Drift in a Popular Metal Oxide Sensor Dataset Reveals Limitations for Gas Classification Benchmarks | Aug 19, 2021 | BenchmarkingClassification | —Unverified | 0 |
| DRIV100: In-The-Wild Multi-Domain Dataset and Evaluation for Real-World Domain Adaptation of Semantic Segmentation | Jan 30, 2021 | BenchmarkingDomain Adaptation | —Unverified | 0 |
| DSLOB: A Synthetic Limit Order Book Dataset for Benchmarking Forecasting Algorithms under Distributional Shift | Nov 17, 2022 | BenchmarkingTime Series | —Unverified | 0 |
| Dual Encoder-Decoder based Generative Adversarial Networks for Disentangled Facial Representation Learning | Sep 19, 2019 | BenchmarkingDecoder | —Unverified | 0 |
| Dual Task Framework for Improving Persona-grounded Dialogue Dataset | Feb 11, 2022 | Benchmarking | —Unverified | 0 |
| DyFEn: Agent-Based Fee Setting in Payment Channel Networks | Oct 15, 2022 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Dyna-bAbI: unlocking bAbI's potential with dynamic synthetic benchmarking | Nov 30, 2021 | BenchmarkingNatural Language Understanding | —Unverified | 0 |
| Dyna-bAbI: unlocking bAbI’s potential with dynamic synthetic benchmarking | Jul 1, 2022 | BenchmarkingNatural Language Understanding | —Unverified | 0 |
| Dynabench: Rethinking Benchmarking in NLP | Apr 7, 2021 | Benchmarking | —Unverified | 0 |
| Dynaboard: An Evaluation-As-A-Service Platform for Holistic Next-Generation Benchmarking | May 21, 2021 | Benchmarking | —Unverified | 0 |
| Dynamic benchmarking framework for LLM-based conversational data capture | Feb 4, 2025 | Benchmarking | —Unverified | 0 |
| Dynamic Benchmarking of Masked Language Models on Temporal Concept Drift with Multiple Views | Feb 23, 2023 | Benchmarking | —Unverified | 0 |
| Dynamic Benchmarking of Reasoning Capabilities in Code Large Language Models Under Data Contamination | Mar 6, 2025 | Benchmarking | —Unverified | 0 |