| Fluorescent Neuronal Cells v2: Multi-Task, Multi-Format Annotations for Deep Learning in Microscopy | Jul 26, 2023 | Benchmarkingobject-detection | —Unverified | 0 |
| FMBench: Benchmarking Fairness in Multimodal Large Language Models on Medical Tasks | Oct 1, 2024 | BenchmarkingFairness | —Unverified | 0 |
| uto\!L: Autonomous Evaluation of LLMs for Truth Maintenance and Reasoning Tasks | Oct 11, 2024 | BenchmarkingLanguage Modeling | —Unverified | 0 |
| ForamViT-GAN: Exploring New Paradigms in Deep Learning for Micropaleontological Image Analysis | Apr 9, 2023 | BenchmarkingDeep Learning | —Unverified | 0 |
| Forecasting Lithium-Ion Battery Longevity with Limited Data Availability: Benchmarking Different Machine Learning Algorithms | Dec 10, 2023 | Battery cycle life predictionBenchmarking | —Unverified | 0 |
| Forecasting NIFTY 50 benchmark Index using Seasonal ARIMA time series models | Jan 9, 2020 | BenchmarkingTime Series | —Unverified | 0 |
| FOR-instance: a UAV laser scanning benchmark dataset for semantic and instance segmentation of individual trees | Sep 3, 2023 | BenchmarkingInstance Segmentation | —Unverified | 0 |
| FORLAPS: An Innovative Data-Driven Reinforcement Learning Approach for Prescriptive Process Monitoring | Jan 17, 2025 | BenchmarkingData Augmentation | —Unverified | 0 |
| Formal Covariate Benchmarking to Bound Omitted Variable Bias | Jun 18, 2023 | BenchmarkingSensitivity | —Unverified | 0 |
| FormFactory: An Interactive Benchmarking Suite for Multimodal Form-Filling Agents | Jun 2, 2025 | BenchmarkingForm | —Unverified | 0 |
| Foundation Models for Remote Sensing: An Analysis of MLLMs for Object Localization | Apr 14, 2025 | BenchmarkingEarth Observation | —Unverified | 0 |
| Foundations for learning from noisy quantum experiments | Apr 28, 2022 | Benchmarking | —Unverified | 0 |
| Found in Translation: Measuring Multilingual LLM Consistency as Simple as Translate then Evaluate | May 28, 2025 | Benchmarking | —Unverified | 0 |
| FoundTS: Comprehensive and Unified Benchmarking of Foundation Models for Time Series Forecasting | Oct 15, 2024 | Benchmarkingenergy management | —Unverified | 0 |
| Framework and Benchmarks for Combinatorial and Mixed-variable Bayesian Optimization | Jun 16, 2023 | Bayesian OptimizationBenchmarking | —Unverified | 0 |
| FRED: The Florence RGB-Event Drone Dataset | Jun 5, 2025 | BenchmarkingTrajectory Forecasting | —Unverified | 0 |
| Free Performance Gain from Mixing Multiple Partially Labeled Samples in Multi-label Image Classification | May 24, 2024 | BenchmarkingData Augmentation | —Unverified | 0 |
| From 2D to 3D: Re-thinking Benchmarking of Monocular Depth Prediction | Mar 15, 2022 | 3D geometryBenchmarking | —Unverified | 0 |
| From Audio Encoders to Piano Judges: Benchmarking Performance Understanding for Solo Piano | Jul 5, 2024 | AttributeBenchmarking | —Unverified | 0 |
| From Blind Solvers to Logical Thinkers: Benchmarking LLMs' Logical Integrity on Faulty Mathematical Problems | Oct 24, 2024 | BenchmarkingCommon Sense Reasoning | —Unverified | 0 |
| From Code to Play: Benchmarking Program Search for Games Using Large Language Models | Dec 5, 2024 | Atari GamesBenchmarking | —Unverified | 0 |
| From Environmental Sound Representation to Robustness of 2D CNN Models Against Adversarial Attacks | Apr 14, 2022 | Adversarial AttackAdversarial Robustness | —Unverified | 0 |
| From Generalist to Specialist: Improving Large Language Models for Medical Physics Using ARCoT | May 17, 2024 | BenchmarkingMultiple-choice | —Unverified | 0 |
| From Generation to Detection: A Multimodal Multi-Task Dataset for Benchmarking Health Misinformation | May 24, 2025 | ArticlesBenchmarking | —Unverified | 0 |
| From Grounding to Planning: Benchmarking Bottlenecks in Web Agents | Sep 3, 2024 | Benchmarking | —Unverified | 0 |