| Chain of LoRA: Efficient Fine-tuning of Language Models via Residual Learning | Jan 8, 2024 | BenchmarkingCoLA | —Unverified | 0 |
| Audio Turing Test: Benchmarking the Human-likeness of Large Language Model-based Text-to-Speech Systems in Chinese | May 16, 2025 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| Egocentric Human-Object Interaction Detection: A New Benchmark and Method | Jun 17, 2025 | BenchmarkingHuman-Object Interaction Detection | —Unverified | 0 |
| EVOPS Benchmark: Evaluation of Plane Segmentation from RGBD and LiDAR Data | Apr 12, 2022 | BenchmarkingSegmentation | —Unverified | 0 |
| CXPMRG-Bench: Pre-training and Benchmarking for X-ray Medical Report Generation on CheXpert Plus Dataset | Oct 1, 2024 | BenchmarkingContrastive Learning | —Unverified | 0 |
| C-FedRAG: A Confidential Federated Retrieval-Augmented Generation System | Dec 17, 2024 | BenchmarkingRAG | —Unverified | 0 |
| CzechLynx: A Dataset for Individual Identification and Pose Estimation of the Eurasian Lynx | Jun 5, 2025 | 2D Pose EstimationBenchmarking | —Unverified | 0 |
| CETBench: A Novel Dataset constructed via Transformations over Programs for Benchmarking LLMs for Code-Equivalence Checking | Jun 4, 2025 | BenchmarkingCode Generation | —Unverified | 0 |
| Benchmarking and Improving Generator-Validator Consistency of Language Models | Oct 3, 2023 | BenchmarkingInstruction Following | —Unverified | 0 |
| Certifying almost all quantum states with few single-qubit measurements | Apr 10, 2024 | AllBenchmarking | —Unverified | 0 |