| Creating and Leveraging a Synthetic Dataset of Cloud Optical Thickness Measures for Cloud Detection in MSI | Nov 23, 2023 | BenchmarkingCloud Detection | CodeCode Available | 0 | 5 |
| CREPO: An Open Repository to Benchmark Credal Network Algorithms | May 10, 2021 | Benchmarking | CodeCode Available | 0 | 5 |
| AdamZ: An Enhanced Optimisation Method for Neural Network Training | Nov 22, 2024 | Benchmarking | CodeCode Available | 0 | 5 |
| Improvements & Evaluations on the MLCommons CloudMask Benchmark | Mar 7, 2024 | Benchmarking | CodeCode Available | 0 | 5 |
| Bias Analysis and Mitigation in the Evaluation of Authorship Verification | Jul 1, 2019 | Authorship VerificationBenchmarking | CodeCode Available | 0 | 5 |
| BED: Bi-Encoder-Based Detectors for Out-of-Distribution Detection | Jun 15, 2023 | BenchmarkingOut-of-Distribution Detection | CodeCode Available | 0 | 5 |
| Critical review of conformational B-cell epitope prediction methods | Jan 10, 2023 | BenchmarkingDrug Design | CodeCode Available | 0 | 5 |
| Illusory VQA: Benchmarking and Enhancing Multimodal Models on Visual Illusions | Dec 11, 2024 | BenchmarkingQuestion Answering | CodeCode Available | 0 | 5 |
| BEARD: Benchmarking the Adversarial Robustness for Dataset Distillation | Nov 14, 2024 | Adversarial AttackAdversarial Robustness | CodeCode Available | 0 | 5 |
| AMQA: An Adversarial Dataset for Benchmarking Bias of LLMs in Medicine and Healthcare | May 26, 2025 | BenchmarkingMedical Diagnosis | CodeCode Available | 0 | 5 |