| Challenges in Benchmarking Stream Learning Algorithms with Real-world Data | Apr 30, 2020 | Benchmarking | —Unverified | 0 |
| Challenges and Pitfalls of Machine Learning Evaluation and Benchmarking | Apr 29, 2019 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Benchmarking Evaluation Metrics for Code-Switching Automatic Speech Recognition | Nov 22, 2022 | Automatic Speech RecognitionAutomatic Speech Recognition (ASR) | —Unverified | 0 |
| Benchmarking and Learning Multi-Dimensional Quality Evaluator for Text-to-3D Generation | Dec 15, 2024 | 3D GenerationBenchmarking | —Unverified | 0 |
| CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus Dataset | Oct 1, 2024 | BenchmarkingContrastive Learning | —Unverified | 0 |
| Challenges and perspectives in computational deconvolution of genomics data | Nov 21, 2022 | Benchmarking | —Unverified | 0 |
| CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx | Jun 5, 2025 | 2D Pose EstimationBenchmarking | —Unverified | 0 |
| Benchmarking and In-depth Performance Study of Large Language Models on Habana Gaudi Processors | Sep 29, 2023 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| AN ELIXIR FOR BLOCKCHAIN SCALABILITY WITH CHANNEL BASED CLUSTERED SHARDING | Dec 20, 2023 | Benchmarking | —Unverified | 0 |
| DACOS-A Manually Annotated Dataset of Code Smells | Mar 15, 2023 | Benchmarking | —Unverified | 0 |
| DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles | Jul 1, 2022 | Abstractive Text SummarizationArticles | —Unverified | 0 |
| DailyQA: A Benchmark to Evaluate Web Retrieval Augmented LLMs Based on Capturing Real-World Changes | May 22, 2025 | BenchmarkingRAG | —Unverified | 0 |
| Challenges and Advancements in Modeling Shock Fronts with Physics-Informed Neural Networks: A Review and Benchmarking Study | Mar 14, 2025 | Benchmarking | —Unverified | 0 |
| Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization | Feb 3, 2022 | 3D ReconstructionBenchmarking | —Unverified | 0 |
| DarkBench: Benchmarking Dark Patterns in Large Language Models | Mar 13, 2025 | Benchmarking | —Unverified | 0 |
| DASB -- Discrete Audio and Speech Benchmark | Jun 20, 2024 | BenchmarkingEmotion Recognition | —Unverified | 0 |
| Data Analysis in the Era of Generative AI | Sep 27, 2024 | Benchmarking | —Unverified | 0 |
| Data and its (dis)contents: A survey of dataset development and use in machine learning research | Dec 9, 2020 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Data Augmentation for Continual RL via Adversarial Gradient Episodic Memory | Aug 24, 2024 | BenchmarkingData Augmentation | —Unverified | 0 |
| Data Augmentation for Traffic Classification | Jan 19, 2024 | BenchmarkingClassification | —Unverified | 0 |
| Data Collection of Real-Life Knowledge Work in Context: The RLKWiC Dataset | Apr 16, 2024 | BenchmarkingManagement | —Unverified | 0 |
| Data-driven Approach for Static Hedging of Exchange Traded Options | Feb 1, 2023 | BenchmarkingInterpretable Machine Learning | —Unverified | 0 |
| Challenge Results Are Not Reproducible | Jul 14, 2023 | BenchmarkingImage Segmentation | —Unverified | 0 |
| Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning | Jan 14, 2025 | BenchmarkingManagement | —Unverified | 0 |
| A Dataset Similarity Evaluation Framework for Wireless Communications and Sensing | Dec 7, 2024 | BenchmarkingDimensionality Reduction | —Unverified | 0 |
| Data-driven surrogate modelling and benchmarking for process equipment | Mar 13, 2020 | Active LearningBenchmarking | —Unverified | 0 |
| Data-Driven Target Localization: Benchmarking Gradient Descent Using the Cramer-Rao Bound | Jan 20, 2024 | Benchmarking | —Unverified | 0 |
| Benchmarking Federated Machine Unlearning methods for Tabular Data | Apr 1, 2025 | BenchmarkingComputational Efficiency | —Unverified | 0 |
| ChakmaNMT: A Low-resource Machine Translation On Chakma Language | Oct 14, 2024 | BenchmarkingMachine Translation | —Unverified | 0 |
| Chain of LoRA: Efficient Fine-tuning of Language Models via Residual Learning | Jan 8, 2024 | BenchmarkingCoLA | —Unverified | 0 |
| Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese | May 16, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| End-to-End Neural Ranking for eCommerce Product Search: an application of task models and textual embeddings | Jun 19, 2018 | Benchmarking | —Unverified | 0 |
| C-FedRAG: A Confidential Federated Retrieval-Augmented Generation System | Dec 17, 2024 | BenchmarkingRAG | —Unverified | 0 |
| CETBench: A Novel Dataset constructed via Transformations over Programs for Benchmarking LLMs for Code-Equivalence Checking | Jun 4, 2025 | BenchmarkingCode Generation | —Unverified | 0 |
| Benchmarking and Improving Generator-Validator Consistency of Language Models | Oct 3, 2023 | BenchmarkingInstruction Following | —Unverified | 0 |
| Certifying almost all quantum states with few single-qubit measurements | Apr 10, 2024 | AllBenchmarking | —Unverified | 0 |
| A Platform for Event Extraction in Hindi | May 1, 2020 | ArticlesBenchmarking | —Unverified | 0 |
| DB3V: A Dialect Dominated Dataset of Bird Vocalisation for Cross-corpus Bird Species Recognition | Jun 11, 2024 | BenchmarkingCross-corpus | —Unverified | 0 |
| DBsurf: A Discrepancy Based Method for Discrete Stochastic Gradient Estimation | Sep 7, 2023 | BenchmarkingNeural Architecture Search | —Unverified | 0 |
| Certified Adversarial Defenses Meet Out-of-Distribution Corruptions: Benchmarking Robustness and Simple Baselines | Dec 1, 2021 | Adversarial RobustnessBenchmarking | —Unverified | 0 |
| An efficient and perceptually motivated auditory neural encoding and decoding algorithm for spiking neural networks | Sep 3, 2019 | Benchmarkingspeech-recognition | —Unverified | 0 |
| DDR-ID: Dual Deep Reconstruction Networks Based Image Decomposition for Anomaly Detection | Jul 18, 2020 | Adversarial AttackAdversarial Attack Detection | —Unverified | 0 |
| CellCycleGAN: Spatiotemporal Microscopy Image Synthesis of Cell Populations using Statistical Shape Models and Conditional GANs | Oct 22, 2020 | BenchmarkingCell Segmentation | —Unverified | 0 |
| DeAR: Debiasing Vision-Language Models with Additive Residuals | Mar 18, 2023 | AttributeBenchmarking | —Unverified | 0 |
| CDTB: A Color and Depth Visual Object Tracking Dataset and Benchmark | Jul 1, 2019 | BenchmarkingObject Tracking | —Unverified | 0 |
| DECASTE: Unveiling Caste Stereotypes in Large Language Models through Multi-Dimensional Bias Analysis | May 20, 2025 | BenchmarkingFairness | —Unverified | 0 |
| An efficiency analysis of Spanish airports | Nov 8, 2023 | Benchmarking | —Unverified | 0 |
| Decentralized Federated Learning on the Edge over Wireless Mesh Networks | Nov 2, 2023 | BenchmarkingFederated Learning | —Unverified | 0 |
| 1-D Convlutional Neural Networks for the Analysis of Pupil Size Variations in Scotopic Conditions | Feb 6, 2020 | BenchmarkingBinary Classification | —Unverified | 0 |
| Energy-Conscious LLM Decoding: Impact of Text Generation Strategies on GPU Energy Consumption | Feb 17, 2025 | BenchmarkingCode Summarization | —Unverified | 0 |