| Chain of LoRA: Efficient Fine-tuning of Language Models via Residual Learning | Jan 8, 2024 | BenchmarkingCoLA | —Unverified | 0 |
| Benchmarking Ethical and Safety Risks of Healthcare LLMs in China-Toward Systemic Governance under Healthy China 2030 | May 12, 2025 | BenchmarkingEthics | —Unverified | 0 |
| Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese | May 16, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Emerging Approaches for THz Array Imaging: A Tutorial Review and Software Tool | Sep 16, 2023 | BenchmarkingImage Super-Resolution | —Unverified | 0 |
| CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus Dataset | Oct 1, 2024 | BenchmarkingContrastive Learning | —Unverified | 0 |
| C-FedRAG: A Confidential Federated Retrieval-Augmented Generation System | Dec 17, 2024 | BenchmarkingRAG | —Unverified | 0 |
| CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx | Jun 5, 2025 | 2D Pose EstimationBenchmarking | —Unverified | 0 |
| CETBench: A Novel Dataset constructed via Transformations over Programs for Benchmarking LLMs for Code-Equivalence Checking | Jun 4, 2025 | BenchmarkingCode Generation | —Unverified | 0 |
| Benchmarking and Improving Generator-Validator Consistency of Language Models | Oct 3, 2023 | BenchmarkingInstruction Following | —Unverified | 0 |
| DACOS-A Manually Annotated Dataset of Code Smells | Mar 15, 2023 | Benchmarking | —Unverified | 0 |
| DACSA: A large-scale Dataset for Automatic summarization of Catalan and Spanish newspaper Articles | Jul 1, 2022 | Abstractive Text SummarizationArticles | —Unverified | 0 |
| DailyQA: A Benchmark to Evaluate Web Retrieval Augmented LLMs Based on Capturing Real-World Changes | May 22, 2025 | BenchmarkingRAG | —Unverified | 0 |
| Certifying almost all quantum states with few single-qubit measurements | Apr 10, 2024 | AllBenchmarking | —Unverified | 0 |
| Danish Airs and Grounds: A Dataset for Aerial-to-Street-Level Place Recognition and Localization | Feb 3, 2022 | 3D ReconstructionBenchmarking | —Unverified | 0 |
| DarkBench: Benchmarking Dark Patterns in Large Language Models | Mar 13, 2025 | Benchmarking | —Unverified | 0 |
| DASB -- Discrete Audio and Speech Benchmark | Jun 20, 2024 | BenchmarkingEmotion Recognition | —Unverified | 0 |
| Data Analysis in the Era of Generative AI | Sep 27, 2024 | Benchmarking | —Unverified | 0 |
| Data and its (dis)contents: A survey of dataset development and use in machine learning research | Dec 9, 2020 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Data Augmentation for Continual RL via Adversarial Gradient Episodic Memory | Aug 24, 2024 | BenchmarkingData Augmentation | —Unverified | 0 |
| Data Augmentation for Traffic Classification | Jan 19, 2024 | BenchmarkingClassification | —Unverified | 0 |
| Data Collection of Real-Life Knowledge Work in Context: The RLKWiC Dataset | Apr 16, 2024 | BenchmarkingManagement | —Unverified | 0 |
| Data-driven Approach for Static Hedging of Exchange Traded Options | Feb 1, 2023 | BenchmarkingInterpretable Machine Learning | —Unverified | 0 |
| Certified Adversarial Defenses Meet Out-of-Distribution Corruptions: Benchmarking Robustness and Simple Baselines | Dec 1, 2021 | Adversarial RobustnessBenchmarking | —Unverified | 0 |
| Data-driven inventory management for new products: An adjusted Dyna-Q approach with transfer learning | Jan 14, 2025 | BenchmarkingManagement | —Unverified | 0 |
| An efficient and perceptually motivated auditory neural encoding and decoding algorithm for spiking neural networks | Sep 3, 2019 | Benchmarkingspeech-recognition | —Unverified | 0 |