| Benchmarking YOLOv8 for Optimal Crack Detection in Civil Infrastructure | Jan 12, 2025 | BenchmarkingHyperparameter Optimization | —Unverified | 0 |
| AV-Reasoner: Improving and Benchmarking Clue-Grounded Audio-Visual Counting for MLLMs | Jun 5, 2025 | BenchmarkingVideo Understanding | —Unverified | 0 |
| Benchmarking XAI Explanations with Human-Aligned Evaluations | Nov 4, 2024 | Benchmarking | —Unverified | 0 |
| A critical look at the current train/test split in machine learning | Jun 8, 2021 | Active LearningBenchmarking | —Unverified | 0 |
| Forecasting NIFTY 50 benchmark Index using Seasonal ARIMA time series models | Jan 9, 2020 | BenchmarkingTime Series | —Unverified | 0 |
| FORLAPS: An Innovative Data-Driven Reinforcement Learning Approach for Prescriptive Process Monitoring | Jan 17, 2025 | BenchmarkingData Augmentation | —Unverified | 0 |
| Found in Translation: Measuring Multilingual LLM Consistency as Simple as Translate then Evaluate | May 28, 2025 | Benchmarking | —Unverified | 0 |
| Benchmarking with MIMIC-IV, an irregular, spare clinical time series dataset | Jan 27, 2024 | BenchmarkingTime Series | —Unverified | 0 |
| A Video is Worth 10,000 Words: Training and Benchmarking with Diverse Captions for Better Long Video Retrieval | Nov 30, 2023 | BenchmarkingRetrieval | —Unverified | 0 |
| Alpha Excel Benchmark | May 7, 2025 | Benchmarking | —Unverified | 0 |
| Benchmarking Waitlist Mortality Prediction in Heart Transplantation Through Time-to-Event Modeling using New Longitudinal UNOS Dataset | Jul 9, 2025 | BenchmarkingDecision Making | —Unverified | 0 |
| Benchmarking VLMs' Reasoning About Persuasive Atypical Images | Sep 16, 2024 | BenchmarkingObject Recognition | —Unverified | 0 |
| A Bayesian Committee Machine Potential for Oxygen-containing Organic Compounds | Mar 2, 2024 | BenchmarkingPosition | —Unverified | 0 |
| Benchmarking Visual-Inertial Deep Multimodal Fusion for Relative Pose Regression and Odometry-aided Absolute Pose Regression | Aug 1, 2022 | Benchmarkingregression | —Unverified | 0 |
| AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels | Aug 30, 2022 | Benchmarking | —Unverified | 0 |
| Benchmarking Vision Language Models on German Factual Data | Apr 15, 2025 | Benchmarking | —Unverified | 0 |
| Auto-tuning TensorFlow Threading Model for CPU Backend | Dec 4, 2018 | BenchmarkingCPU | —Unverified | 0 |
| ForamViT-GAN: Exploring New Paradigms in Deep Learning for Micropaleontological Image Analysis | Apr 9, 2023 | BenchmarkingDeep Learning | —Unverified | 0 |
| Benchmarking Vision Language Models for Cultural Understanding | Jul 15, 2024 | BenchmarkingQuestion Answering | —Unverified | 0 |
| ALP: Action-Aware Embodied Learning for Perception | Jun 16, 2023 | Benchmarkingobject-detection | —Unverified | 0 |
| Autoregressive Stochastic Clock Jitter Compensation in Analog-to-Digital Converters | May 8, 2025 | Benchmarking | —Unverified | 0 |
| A critical analysis of metrics used for measuring progress in artificial intelligence | Aug 6, 2020 | Benchmarking | —Unverified | 0 |
| Benchmarking Vision Foundation Models for Input Monitoring in Autonomous Driving | Jan 14, 2025 | Autonomous DrivingBenchmarking | —Unverified | 0 |
| Benchmarking Vision-Based Object Tracking for USVs in Complex Maritime Environments | Dec 10, 2024 | Benchmarkingobject-detection | —Unverified | 0 |
| Benchmarking Video Frame Interpolation | Mar 25, 2024 | BenchmarkingComputational Efficiency | —Unverified | 0 |