| EVOPS Benchmark: Evaluation of Plane Segmentation from RGBD and LiDAR Data | Apr 12, 2022 | BenchmarkingSegmentation | —Unverified | 0 |
| EXACT: Towards a platform for empirically benchmarking Machine Learning model explanation methods | May 20, 2024 | BenchmarkingExplainable artificial intelligence | —Unverified | 0 |
| Explicitly Multi-Modal Benchmarks for Multi-Objective Optimization | Oct 7, 2021 | Benchmarking | —Unverified | 0 |
| CALF: Benchmarking Evaluation of LFQA Using Chinese Examinations | Oct 2, 2024 | BenchmarkingLong Form Question Answering | —Unverified | 0 |
| Benchmarking a foundation LLM on its ability to re-label structure names in accordance with the AAPM TG-263 report | Oct 5, 2023 | Benchmarking | —Unverified | 0 |
| CAFA-evaluator: A Python Tool for Benchmarking Ontological Classification Methods | Oct 10, 2023 | BenchmarkingPrediction | —Unverified | 0 |
| Analyzing Multilingual Competency of LLMs in Multi-Turn Instruction Following: A Case Study of Arabic | Oct 23, 2023 | BenchmarkingInstruction Following | —Unverified | 0 |
| Quantum Similarity Testing with Convolutional Neural Networks | Nov 3, 2022 | Benchmarking | —Unverified | 0 |
| Ev-Layout: A Large-scale Event-based Multi-modal Dataset for Indoor Layout Estimation and Tracking | Mar 11, 2025 | Benchmarking | —Unverified | 0 |
| Byzantine-Robust and Communication-Efficient Distributed Learning via Compressed Momentum Filtering | Sep 13, 2024 | BenchmarkingBinary Classification | —Unverified | 0 |