| TotalVibeSegmentator: Full Body MRI Segmentation for the NAKO and UK Biobank | May 31, 2024 | EpidemiologyHoldout Set | CodeCode Available | 2 |
| Distribution-Free, Risk-Controlling Prediction Sets | Jan 7, 2021 | BIG-bench Machine LearningClassification | CodeCode Available | 2 |
| Liver Tumor Screening and Diagnosis in CT with Pixel-Lesion-Patient Network | Jul 17, 2023 | Computed Tomography (CT)Holdout Set | CodeCode Available | 1 |
| Understanding Transformers via N-gram Statistics | Jun 30, 2024 | Holdout Set | CodeCode Available | 1 |
| Template-Based Automatic Search of Compact Semantic Segmentation Architectures | Apr 4, 2019 | General ClassificationHoldout Set | CodeCode Available | 1 |
| xView3-SAR: Detecting Dark Fishing Activity Using Synthetic Aperture Radar Imagery | Jun 2, 2022 | Decision Making Under UncertaintyHoldout Set | CodeCode Available | 1 |
| Challenges in Bayesian Adaptive Data Analysis | Apr 8, 2016 | Holdout Set | —Unverified | 0 |
| Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts | Oct 11, 2024 | Holdout SetMisconceptions | —Unverified | 0 |
| A Meta-Analysis of Overfitting in Machine Learning | Dec 1, 2019 | BIG-bench Machine LearningHoldout Set | —Unverified | 0 |
| Diversified Ensembling: An Experiment in Crowdsourced Machine Learning | Feb 16, 2024 | FairnessHoldout Set | —Unverified | 0 |