| A Look at the Evaluation Setup of the M5 Forecasting Competition | Aug 8, 2021 | BenchmarkingDecision Making | —Unverified | 0 |
| From 2D to 3D: Re-thinking Benchmarking of Monocular Depth Prediction | Mar 15, 2022 | 3D geometryBenchmarking | —Unverified | 0 |
| From Blind Solvers to Logical Thinkers: Benchmarking LLMs' Logical Integrity on Faulty Mathematical Problems | Oct 24, 2024 | BenchmarkingCommon Sense Reasoning | —Unverified | 0 |
| Benchmarking Unsupervised Outlier Detection with Realistic Synthetic Data | Apr 15, 2020 | BenchmarkingOutlier Detection | —Unverified | 0 |
| A Comprehensive Survey on Retrieval Methods in Recommender Systems | Jul 11, 2024 | BenchmarkingRecommendation Systems | —Unverified | 0 |
| ALOJA-ML: A Framework for Automating Characterization and Knowledge Discovery in Hadoop Deployments | Nov 6, 2015 | Anomaly DetectionBenchmarking | —Unverified | 0 |
| Benchmarking unsupervised near-duplicate image detection | Jul 3, 2019 | BenchmarkingBinary Classification | —Unverified | 0 |
| Abasy Atlas v2.2: The most comprehensive and up-to-date inventory of meta-curated, historical, bacterial regulatory networks, their completeness and system-level characterization | May 5, 2020 | Benchmarking | —Unverified | 0 |
| FRED: The Florence RGB-Event Drone Dataset | Jun 5, 2025 | BenchmarkingTrajectory Forecasting | —Unverified | 0 |
| Benchmarking Unsupervised Anomaly Detection and Localization | May 30, 2022 | Anomaly DetectionBenchmarking | —Unverified | 0 |
| Benchmarking Unified Face Attack Detection via Hierarchical Prompt Tuning | May 19, 2025 | Benchmarking | —Unverified | 0 |
| Automating Code Adaptation for MLOps -- A Benchmarking Study on LLMs | May 10, 2024 | BenchmarkingHyperparameter Optimization | —Unverified | 0 |
| Benchmarking Uncertainty Quantification on Biosignal Classification Tasks under Dataset Shift | Dec 16, 2021 | BenchmarkingClassification | —Unverified | 0 |
| Automatic vehicle trajectory data reconstruction at scale | Dec 15, 2022 | Benchmarkingvehicle detection | —Unverified | 0 |
| ALOJA: A Framework for Benchmarking and Predictive Analytics in Big Data Deployments | Nov 6, 2015 | Anomaly DetectionBenchmarking | —Unverified | 0 |
| Benchmarking Ultra-Low-Power μNPUs | Mar 28, 2025 | Benchmarking | —Unverified | 0 |
| Automatic Target Recognition on Synthetic Aperture Radar Imagery: A Survey | Jul 4, 2020 | BenchmarkingSurvey | —Unverified | 0 |
| Benchmarking Ultra-High-Definition Image Super-Resolution | Jan 1, 2021 | 4k8k | —Unverified | 0 |
| Almost Equivariance via Lie Algebra Convolutions | Oct 19, 2023 | Benchmarking | —Unverified | 0 |
| Benchmarking performance, explainability, and evaluation strategies of vision-language models for surgery: Challenges and opportunities | May 16, 2025 | Benchmarking | —Unverified | 0 |
| Benchmarking Twitter Sentiment Analysis Tools | May 1, 2014 | BenchmarkingDecision Making | —Unverified | 0 |
| MultiTrust: A Comprehensive Benchmark Towards Trustworthy Multimodal Large Language Models | Jun 11, 2024 | BenchmarkingFairness | —Unverified | 0 |
| Automatic segmenting teeth in X-ray images: Trends, a novel data set, benchmarking and future perspectives | Feb 9, 2018 | BenchmarkingImage Segmentation | —Unverified | 0 |
| Benchmarking Transformers-based models on French Spoken Language Understanding tasks | Jul 19, 2022 | BenchmarkingSpoken Language Understanding | —Unverified | 0 |
| Scaling laws in global corporations as a benchmarking approach to assess environmental performance | Jun 7, 2022 | BenchmarkingOpen-Ended Question Answering | —Unverified | 0 |