| fseval: A Benchmarking Framework for Feature Selection and Feature Ranking Algorithms | Nov 23, 2022 | Automated Feature EngineeringBenchmarking | CodeCode Available | 1 | 5 |
| Should we be going MAD? A Look at Multi-Agent Debate Strategies for LLMs | Nov 29, 2023 | Benchmarking | CodeCode Available | 1 | 5 |
| Are Vision Language Models Ready for Clinical Diagnosis? A 3D Medical Benchmark for Tumor-centric Visual Question Answering | May 25, 2025 | AnatomyBenchmarking | CodeCode Available | 1 | 5 |
| FragXsiteDTI: Revealing Responsible Segments in Drug-Target Interaction with Transformer-Driven Interpretation | Nov 4, 2023 | BenchmarkingDrug Discovery | CodeCode Available | 1 | 5 |
| 3D AffordanceNet: A Benchmark for Visual Object Affordance Understanding | Mar 30, 2021 | Affordance DetectionBenchmarking | CodeCode Available | 1 | 5 |
| Foundation Model of Electronic Medical Records for Adaptive Risk Estimation | Feb 10, 2025 | Benchmarking | CodeCode Available | 1 | 5 |
| FreeMan: Towards Benchmarking 3D Human Pose Estimation under Real-World Conditions | Sep 10, 2023 | 3D Human Pose Estimation3D Pose Estimation | CodeCode Available | 1 | 5 |
| FTNet: Feature Transverse Network for Thermal Image Semantic Segmentation | Oct 26, 2021 | BenchmarkingScene Segmentation | CodeCode Available | 1 | 5 |
| GastroVision: A Multi-class Endoscopy Image Dataset for Computer Aided Gastrointestinal Disease Detection | Jul 16, 2023 | Benchmarking | CodeCode Available | 1 | 5 |
| Benchmarking emergency department triage prediction models with machine learning and large public electronic health records | Nov 22, 2021 | Benchmarking | CodeCode Available | 1 | 5 |