| Automated deep learning segmentation of high-resolution 7 T postmortem MRI for quantitative analysis of structure-pathology correlations in neurodegenerative diseases | Mar 21, 2023 | AnatomyBenchmarking | CodeCode Available | 0 | 5 |
| IceBench: A Benchmark for Deep Learning based Sea Ice Type Classification | Mar 22, 2025 | BenchmarkingClassification | CodeCode Available | 0 | 5 |
| Integrating Large Language Models and Knowledge Graphs for Extraction and Validation of Textual Test Data | Aug 3, 2024 | BenchmarkingKnowledge Graphs | CodeCode Available | 0 | 5 |
| IdeaBench: Benchmarking Large Language Models for Research Idea Generation | Oct 31, 2024 | Benchmarkingscientific discovery | CodeCode Available | 0 | 5 |
| AutoJudger: An Agent-Driven Framework for Efficient Benchmarking of MLLMs | May 27, 2025 | BenchmarkingQuestion Selection | CodeCode Available | 0 | 5 |
| HypoTermQA: Hypothetical Terms Dataset for Benchmarking Hallucination Tendency of LLMs | Feb 25, 2024 | BenchmarkingChatbot | CodeCode Available | 0 | 5 |
| Identifying and Benchmarking Natural Out-of-Context Prediction Problems | Oct 25, 2021 | Benchmarking | CodeCode Available | 0 | 5 |
| Impact of ImageNet Model Selection on Domain Adaptation | Feb 6, 2020 | BenchmarkingDomain Adaptation | CodeCode Available | 0 | 5 |
| Benchmarking the Linear Algebra Awareness of TensorFlow and PyTorch | Feb 20, 2022 | Benchmarking | CodeCode Available | 0 | 5 |
| Hyperbolic Benchmarking Unveils Network Topology-Feature Relationship in GNN Performance | Jun 4, 2024 | BenchmarkingDrug Discovery | CodeCode Available | 0 | 5 |
| AutoBench-V: Can Large Vision-Language Models Benchmark Themselves? | Oct 28, 2024 | BenchmarkingQuestion Answering | CodeCode Available | 0 | 5 |
| Benchmarking the Hooke-Jeeves Method, MTS-LS1, and BSrr on the Large-scale BBOB Function Set | Apr 28, 2022 | Benchmarking | CodeCode Available | 0 | 5 |
| ALDI++: Automatic and parameter-less discord and outlier detection for building energy load profiles | Mar 13, 2022 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 | 5 |
| Hyperopt-Sklearn: Automatic Hyperparameter Configuration for Scikit-Learn | Jan 1, 2014 | AutoMLBenchmarking | CodeCode Available | 0 | 5 |
| Benchmarking the Hill-Valley Evolutionary Algorithm for the GECCO 2018 Competition on Niching Methods Multimodal Optimization | Jun 30, 2018 | Benchmarking | CodeCode Available | 0 | 5 |
| Hybrid Machine Learning Models of Classifying Residential Requests for Smart Dispatching | Dec 22, 2019 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 | 5 |
| Hybrid Random Features | Oct 8, 2021 | Benchmarking | CodeCode Available | 0 | 5 |
| HuSc3D: Human Sculpture dataset for 3D object reconstruction | Jun 9, 2025 | 3D Object Reconstruction3D Reconstruction | CodeCode Available | 0 | 5 |
| Hyperparameter-Free Losses for Model-Based Monocular Reconstruction | Aug 16, 2019 | 3D ReconstructionBenchmarking | CodeCode Available | 0 | 5 |
| Benchmarking the Fairness of Image Upsampling Methods | Jan 24, 2024 | BenchmarkingDiversity | CodeCode Available | 0 | 5 |
| AuthNet: A Deep Learning based Authentication Mechanism using Temporal Facial Feature Movements | Dec 4, 2020 | BenchmarkingLip password classification | CodeCode Available | 0 | 5 |
| Authentic Emotion Mapping: Benchmarking Facial Expressions in Real News | Apr 21, 2024 | BenchmarkingEmotion Recognition | CodeCode Available | 0 | 5 |
| Alchemy: A Quantum Chemistry Dataset for Benchmarking AI Models | Jun 22, 2019 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 | 5 |
| HSSBench: Benchmarking Humanities and Social Sciences Ability for Multimodal Large Language Models | Jun 4, 2025 | BenchmarkingGeneral Knowledge | CodeCode Available | 0 | 5 |
| HRNET: AI on Edge for mask detection and social distancing | Nov 30, 2021 | BenchmarkingEdge-computing | CodeCode Available | 0 | 5 |