| HERMES: Holographic Equivariant neuRal network model for Mutational Effect and Stability prediction | Jul 9, 2024 | Benchmarking | CodeCode Available | 0 |
| HATE-ITA: New Baselines for Hate Speech Detection in Italian | Jul 1, 2022 | BenchmarkingHate Speech Detection | CodeCode Available | 0 |
| Benchmarking YOLOv5 and YOLOv7 models with DeepSORT for droplet tracking applications | Jan 19, 2023 | BenchmarkingGPU | CodeCode Available | 0 |
| Benchmarking White Blood Cell Classification Under Domain Shift | Mar 3, 2023 | BenchmarkingClassification | CodeCode Available | 0 |
| MAYA: Addressing Inconsistencies in Generative Password Guessing through a Unified Benchmark | Apr 23, 2025 | Benchmarking | CodeCode Available | 0 |
| Robust Benchmarking for Machine Learning of Clinical Entity Extraction | Jul 31, 2020 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 |
| MCA-Bench: A Multimodal Benchmark for Evaluating CAPTCHA Robustness Against VLM-based Attacks | Jun 6, 2025 | Benchmarking | CodeCode Available | 0 |
| Benchmarking Vision-Language Contrastive Methods for Medical Representation Learning | Jun 11, 2024 | BenchmarkingContrastive Learning | CodeCode Available | 0 |
| A Wild Bootstrap for Degenerate Kernel Tests | Aug 23, 2014 | BenchmarkingTime Series | CodeCode Available | 0 |
| Harnessing Orthogonality to Train Low-Rank Neural Networks | Jan 16, 2024 | Benchmarking | CodeCode Available | 0 |
| Aux-Drop: Handling Haphazard Inputs in Online Learning Using Auxiliary Dropouts | Mar 9, 2023 | Benchmarking | CodeCode Available | 0 |
| Causally Testing Gender Bias in LLMs: A Case Study on Occupational Bias | Dec 20, 2022 | Benchmarking | CodeCode Available | 0 |
| Benchmarking Unsupervised Strategies for Anomaly Detection in Multivariate Time Series | Jun 25, 2025 | Anomaly DetectionBenchmarking | CodeCode Available | 0 |
| Harmonization Benchmarking Tool for Neuroimaging Datasets | Nov 15, 2022 | BenchmarkingDiffusion MRI | CodeCode Available | 0 |
| Adaptive Shrinkage Estimation For Personalized Deep Kernel Regression In Modeling Brain Trajectories | Apr 10, 2025 | Additive modelsBenchmarking | CodeCode Available | 0 |
| Benchmarking Unsupervised Online IDS for Masquerade Attacks in CAN | Jun 19, 2024 | BenchmarkingIntrusion Detection | CodeCode Available | 0 |
| The iToBoS dataset: skin region images extracted from 3D total body photographs for lesion detection | Jan 30, 2025 | BenchmarkingDiagnostic | CodeCode Available | 0 |
| Benchmarking Ultra-High-Definition Image Reflection Removal | Aug 1, 2023 | BenchmarkingImage Restoration | CodeCode Available | 0 |
| Understanding the Role of LLMs in Multimodal Evaluation Benchmarks | Oct 16, 2024 | BenchmarkingLarge Language Model | CodeCode Available | 0 |
| VocalBench: Benchmarking the Vocal Conversational Abilities for Speech Interaction Models | May 21, 2025 | Benchmarking | CodeCode Available | 0 |
| Measuring what Really Matters: Optimizing Neural Networks for TinyML | Apr 21, 2021 | Benchmarking | CodeCode Available | 0 |
| Benchmarking Traditional Machine Learning and Deep Learning Models for Fault Detection in Power Transformers | May 7, 2025 | BenchmarkingFault Detection | CodeCode Available | 0 |
| Benchmarking TPU, GPU, and CPU Platforms for Deep Learning | Jul 24, 2019 | BenchmarkingCPU | CodeCode Available | 0 |
| RoLargeSum: A Large Dialect-Aware Romanian News Dataset for Summary, Headline, and Keyword Generation | Dec 15, 2024 | ArticlesBenchmarking | CodeCode Available | 0 |
| Hardware Aware Neural Network Architectures using FbNet | Jun 17, 2019 | BenchmarkingNeural Architecture Search | CodeCode Available | 0 |