| Benchmarking BioRelEx for Entity Tagging and Relation Extraction | May 31, 2020 | BenchmarkingRelation | —Unverified | 0 |
| A Deep Q-Learning Method for Downlink Power Allocation in Multi-Cell Networks | Apr 30, 2019 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| DiPCo -- Dinner Party Corpus | Sep 30, 2019 | Benchmarking | —Unverified | 0 |
| Benchmarking Biopharmaceuticals Retrieval-Augmented Generation Evaluation | Apr 15, 2025 | BenchmarkingQuestion Answering | —Unverified | 0 |
| Benchmarking Biomedical Nested NER and Relation Extraction Models | Oct 16, 2021 | BenchmarkingNER | —Unverified | 0 |
| Deep Patent Landscaping Model Using Transformer and Graph Embedding | Mar 14, 2019 | BenchmarkingGraph Embedding | —Unverified | 0 |
| A New Approach for Image Authentication Framework for Media Forensics Purpose | Oct 3, 2021 | AstronomyBenchmarking | —Unverified | 0 |
| Benchmarking Bias in Large Language Models during Role-Playing | Nov 1, 2024 | BenchmarkingFairness | —Unverified | 0 |
| Abnormality-Driven Representation Learning for Radiology Imaging | Nov 25, 2024 | BenchmarkingContrastive Learning | —Unverified | 0 |
| DIMCIM: A Quantitative Evaluation Framework for Default-mode Diversity and Generalization in Text-to-Image Generative Models | Jun 5, 2025 | BenchmarkingDiversity | —Unverified | 0 |
| DiPlomat: A Dialogue Dataset for Situated Pragmatic Reasoning | Jun 15, 2023 | BenchmarkingConversational Question Answering | —Unverified | 0 |
| An Evolutionary Algorithm For the Vehicle Routing Problem with Drones with Interceptions | Sep 21, 2024 | BenchmarkingScheduling | —Unverified | 0 |
| clem:todd: A Framework for the Systematic Benchmarking of LLM-Based Task-Oriented Dialogue System Realisations | May 8, 2025 | BenchmarkingTask-Oriented Dialogue Systems | —Unverified | 0 |
| An evaluation framework for comparing causal inference models | Aug 31, 2022 | BenchmarkingCausal Inference | —Unverified | 0 |
| Benchmarking Bayesian Causal Discovery Methods for Downstream Treatment Effect Estimation | Jul 11, 2023 | BenchmarkingCausal Discovery | —Unverified | 0 |
| DIG: A Turnkey Library for Diving into Graph Deep Learning Research | Mar 23, 2021 | BenchmarkingDeep Learning | —Unverified | 0 |
| Benchmarking Azerbaijani Neural Machine Translation | Jul 29, 2022 | BenchmarkingDomain Generalization | —Unverified | 0 |
| Classification of the Fashion-MNIST Dataset on a Quantum Computer | Mar 4, 2024 | BenchmarkingQuantum Machine Learning | —Unverified | 0 |
| Benchmarking Critical Questions Generation: A Challenging Reasoning Task for Large Language Models | May 16, 2025 | Benchmarking | —Unverified | 0 |
| Benchmarking AutoML Frameworks for Disease Prediction Using Medical Claims | Jul 22, 2021 | AutoMLBenchmarking | —Unverified | 0 |
| Class-agnostic Object Detection | Nov 28, 2020 | BenchmarkingClass-agnostic Object Detection | —Unverified | 0 |
| CLASH: Evaluating Language Models on Judging High-Stakes Dilemmas from Multiple Perspectives | Apr 15, 2025 | Benchmarking | —Unverified | 0 |
| A deep convolutional neural network model for rapid prediction of fluvial flood inundation | Jun 20, 2020 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| Diffusion-Driven Domain Adaptation for Generating 3D Molecules | Apr 1, 2024 | BenchmarkingDecoder | —Unverified | 0 |
| DiLiGenT102: A Photometric Stereo Benchmark Dataset With Controlled Shape and Material Variation | Jan 1, 2022 | Benchmarking | —Unverified | 0 |
| Disability prediction in multiple sclerosis using performance outcome measures and demographic data | Apr 8, 2022 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Discriminative Link Prediction using Local Links, Node Features and Community Structure | Oct 17, 2013 | BenchmarkingClustering | —Unverified | 0 |
| CLAMS: A Cluster Ambiguity Measure for Estimating Perceptual Variability in Visual Clustering | Aug 1, 2023 | BenchmarkingClustering | —Unverified | 0 |
| Benchmarking a wide range of optimisers for solving the Fermi-Hubbard model using the variational quantum eigensolver | Nov 20, 2024 | Benchmarking | —Unverified | 0 |
| Classification and Retrieval of Digital Pathology Scans: A New Dataset | May 22, 2017 | BenchmarkingGeneral Classification | —Unverified | 0 |
| A biologically-inspired multi-modal evaluation of molecular generative machine learning | Aug 20, 2022 | BenchmarkingDrug Discovery | —Unverified | 0 |
| Classifying neuromorphic data using a deep learning framework for image classification | Jul 2, 2018 | BenchmarkingDeep Learning | —Unverified | 0 |
| DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs | May 15, 2025 | BenchmarkingFairness | —Unverified | 0 |
| Benchmarking Automatic Speech Recognition coupled LLM Modules for Medical Diagnostics | Feb 18, 2025 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| DI-BENCH: Benchmarking Large Language Models on Dependency Inference with Testable Repositories at Scale | Jan 23, 2025 | Benchmarking | —Unverified | 0 |
| Diff5T: Benchmarking Human Brain Diffusion MRI with an Extensive 5.0 Tesla K-Space and Spatial Dataset | Dec 9, 2024 | BenchmarkingDiffusion MRI | —Unverified | 0 |
| CityLearn v2: Energy-flexible, resilient, occupant-centric, and carbon-aware management of grid-interactive communities | May 2, 2024 | BenchmarkingManagement | —Unverified | 0 |
| Benchmarking Bayesian Deep Learning on Diabetic Retinopathy Detection Tasks | Nov 23, 2022 | BenchmarkingDeep Learning | —Unverified | 0 |
| Addressing the Real-world Class Imbalance Problem in Dermatology | Oct 9, 2020 | BenchmarkingFew-Shot Learning | —Unverified | 0 |
| CISOL: An Open and Extensible Dataset for Table Structure Recognition in the Construction Industry | Jan 26, 2025 | BenchmarkingObject Detection | —Unverified | 0 |
| Benchmarking Automated Review Response Generation for the Hospitality Domain | Dec 1, 2020 | BenchmarkingDomain Adaptation | —Unverified | 0 |
| Benchmarking bias: Expanding clinical AI model card to incorporate bias reporting of social and non-social factors | Nov 21, 2023 | Benchmarking | —Unverified | 0 |
| Dialogue Games for Benchmarking Language Understanding: Motivation, Taxonomy, Strategy | Apr 14, 2023 | Benchmarking | —Unverified | 0 |
| CLIRudit: Cross-Lingual Information Retrieval of Scientific Documents | Apr 22, 2025 | BenchmarkingCross-Lingual Information Retrieval | —Unverified | 0 |
| DiffBody: Human Body Restoration by Imagining with Generative Diffusion Prior | Apr 4, 2024 | BenchmarkingImage Restoration | —Unverified | 0 |
| CLLMate: A Multimodal Benchmark for Weather and Climate Events Forecasting | Sep 27, 2024 | ArticlesBenchmarking | —Unverified | 0 |
| Benchmarking Automated Machine Learning Methods for Price Forecasting Applications | Apr 28, 2023 | AutoMLBenchmarking | —Unverified | 0 |
| CIMLA: Interpretable AI for inference of differential causal networks | Apr 25, 2023 | Benchmarking | —Unverified | 0 |
| CloudifierNet -- Deep Vision Models for Artificial Image Processing | Nov 4, 2019 | BenchmarkingCode Generation | —Unverified | 0 |
| CIFAR-10-Warehouse: Broad and More Realistic Testbeds in Model Generalization Analysis | Oct 6, 2023 | BenchmarkingDomain Generalization | —Unverified | 0 |