| Benchmarking with MIMIC-IV, an irregular, spare clinical time series dataset | Jan 27, 2024 | BenchmarkingTime Series | —Unverified | 0 |
| A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval | Nov 30, 2023 | BenchmarkingRetrieval | —Unverified | 0 |
| Alpha Excel Benchmark | May 7, 2025 | Benchmarking | —Unverified | 0 |
| Benchmarking Waitlist Mortality Prediction in Heart Transplantation Through Time-to-Event Modeling using New Longitudinal UNOS Dataset | Jul 9, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| Benchmarking VLMs' Reasoning About Persuasive Atypical Images | Sep 16, 2024 | BenchmarkingObject Recognition | —Unverified | 0 |
| A Bayesian Committee Machine Potential for Oxygen-containing Organic Compounds | Mar 2, 2024 | BenchmarkingPosition | —Unverified | 0 |
| Benchmarking Visual-Inertial Deep Multimodal Fusion for Relative Pose Regression and Odometry-aided Absolute Pose Regression | Aug 1, 2022 | Benchmarkingregression | —Unverified | 0 |
| AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels | Aug 30, 2022 | Benchmarking | —Unverified | 0 |
| Benchmarking Vision Language Models on German Factual Data | Apr 15, 2025 | Benchmarking | —Unverified | 0 |
| Auto-tuning TensorFlow Threading Model for CPU Backend | Dec 4, 2018 | BenchmarkingCPU | —Unverified | 0 |