| Distribution-Based Invariant Deep Networks for Learning Meta-Features | Jun 24, 2020 | BenchmarkingGeneral Classification | —Unverified | 0 |
| Sensitivity analysis and experimental evaluation of PID-like continuous sliding mode control | Aug 13, 2022 | BenchmarkingSensitivity | —Unverified | 0 |
| Diverse Community Data for Benchmarking Data Privacy Algorithms | Jun 20, 2023 | Benchmarking | —Unverified | 0 |
| DLBricks: Composable Benchmark Generation to Reduce Deep Learning Benchmarking Effort on CPUs (Extended) | Nov 18, 2019 | BenchmarkingCPU | —Unverified | 0 |
| DLUE: Benchmarking Document Language Understanding | May 16, 2023 | BenchmarkingDocument Classification | —Unverified | 0 |
| DNR Bench: Benchmarking Over-Reasoning in Reasoning LLMs | Mar 20, 2025 | BenchmarkingHallucination | —Unverified | 0 |
| A Sober Look at the Robustness of CLIPs to Spurious Features | Mar 18, 2024 | Benchmarking | —Unverified | 0 |
| Does AI for science need another ImageNet Or totally different benchmarks? A case study of machine learning force fields | Aug 11, 2023 | Benchmarking | —Unverified | 0 |
| Does imputation matter? Benchmark for predictive models | Jul 6, 2020 | BenchmarkingBIG-bench Machine Learning | —Unverified | 0 |
| Domain Adaptation for Arabic Machine Translation: The Case of Financial Texts | Sep 22, 2023 | ArticlesBenchmarking | —Unverified | 0 |