| CleanPatrick: A Benchmark for Image Data Cleaning | May 16, 2025 | BenchmarkingLabel Error Detection | CodeCode Available | 0 |
| Detecting critical treatment effect bias in small subgroups | Apr 29, 2024 | BenchmarkingDecision Making | CodeCode Available | 0 |
| AI-generated Image Quality Assessment in Visual Communication | Dec 20, 2024 | BenchmarkingImage Quality Assessment | CodeCode Available | 0 |
| SOSD: A Benchmark for Learned Indexes | Nov 29, 2019 | BenchmarkingManagement | CodeCode Available | 0 |
| OpenML Benchmarking Suites | Aug 11, 2017 | BenchmarkingBIG-bench Machine Learning | CodeCode Available | 0 |
| DEsignBench: Exploring and Benchmarking DALL-E 3 for Imagining Visual Design | Oct 23, 2023 | BenchmarkingImage Generation | CodeCode Available | 0 |
| Design and implementation of intelligent packet filtering in IoT microcontroller-based devices | May 30, 2023 | Benchmarking | CodeCode Available | 0 |
| OpenOOD: Benchmarking Generalized Out-of-Distribution Detection | Oct 13, 2022 | Anomaly DetectionBenchmarking | CodeCode Available | 0 |
| Dermatological Diagnosis Explainability Benchmark for Convolutional Neural Networks | Feb 23, 2023 | BenchmarkingMedical Diagnosis | CodeCode Available | 0 |
| Depth Functions for Partial Orders with a Descriptive Analysis of Machine Learning Algorithms | Apr 19, 2023 | BenchmarkingDescriptive | CodeCode Available | 0 |
| Delving into Instance-Dependent Label Noise in Graph Data: A Comprehensive Study and Benchmark | Jun 14, 2025 | BenchmarkingGraph Learning | CodeCode Available | 0 |
| Towards Efficient and Scalable Training of Differentially Private Deep Learning | Jun 25, 2024 | BenchmarkingDeep Learning | CodeCode Available | 0 |
| Benchmarking Label Noise in Instance Segmentation: Spatial Noise Matters | Jun 16, 2024 | BenchmarkingInstance Segmentation | CodeCode Available | 0 |
| Towards Efficient Benchmarking of Foundation Models in Remote Sensing: A Capabilities Encoding Approach | May 6, 2025 | BenchmarkingEarth Observation | CodeCode Available | 0 |
| Delta-Influence: Unlearning Poisons via Influence Functions | Nov 20, 2024 | AttributeBenchmarking | CodeCode Available | 0 |
| Benchmarking Keyword Spotting Efficiency on Neuromorphic Hardware | Dec 4, 2018 | BenchmarkingCPU | CodeCode Available | 0 |
| Defense-friendly Images in Adversarial Attacks: Dataset and Metrics for Perturbation Difficulty | Nov 5, 2020 | Adversarial AttackBenchmarking | CodeCode Available | 0 |
| DefAn: Definitive Answer Dataset for LLMs Hallucination Evaluation | Jun 13, 2024 | BenchmarkingHallucination | CodeCode Available | 0 |
| Deep Reinforcement Learning for General Video Game AI | Jun 6, 2018 | Atari GamesBenchmarking | CodeCode Available | 0 |
| DeepPatent2: A Large-Scale Benchmarking Corpus for Technical Drawing Understanding | Nov 7, 2023 | 3D ReconstructionBenchmarking | CodeCode Available | 0 |
| Operation-Level Performance Benchmarking of Graph Neural Networks for Scientific Applications | Jul 20, 2022 | Benchmarking | CodeCode Available | 0 |
| DeepOBS: A Deep Learning Optimizer Benchmark Suite | Mar 13, 2019 | BenchmarkingDeep Learning | CodeCode Available | 0 |
| VarBench: Robust Language Model Benchmarking Through Dynamic Variable Perturbation | Jun 25, 2024 | ARCBenchmarking | CodeCode Available | 0 |
| OptIForest: Optimal Isolation Forest for Anomaly Detection | Jun 22, 2023 | Anomaly DetectionBenchmarking | CodeCode Available | 0 |
| Towards Emotionally Consistent Text-Based Speech Editing: Introducing EmoCorrector and The ECD-TSE Dataset | May 24, 2025 | BenchmarkingRAG | CodeCode Available | 0 |