Holdout Set

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 35 papers

Title	Date	Tasks	Status	Hype
Distribution-Free, Risk-Controlling Prediction Sets	Jan 7, 2021	BIG-bench Machine LearningClassification	CodeCode Available	2
TotalVibeSegmentator: Full Body MRI Segmentation for the NAKO and UK Biobank	May 31, 2024	EpidemiologyHoldout Set	CodeCode Available	2
Liver Tumor Screening and Diagnosis in CT with Pixel-Lesion-Patient Network	Jul 17, 2023	Computed Tomography (CT)Holdout Set	CodeCode Available	1
Understanding Transformers via N-gram Statistics	Jun 30, 2024	Holdout Set	CodeCode Available	1
xView3-SAR: Detecting Dark Fishing Activity Using Synthetic Aperture Radar Imagery	Jun 2, 2022	Decision Making Under UncertaintyHoldout Set	CodeCode Available	1
Template-Based Automatic Search of Compact Semantic Segmentation Architectures	Apr 4, 2019	General ClassificationHoldout Set	CodeCode Available	1
Diversified Ensembling: An Experiment in Crowdsourced Machine Learning	Feb 16, 2024	FairnessHoldout Set	—Unverified	0
Machine Learning for Quantifier Selection in cvc5	Aug 26, 2024	Holdout Set	—Unverified	0
Malaria Likelihood Prediction By Effectively Surveying Households Using Deep Reinforcement Learning	Nov 25, 2017	Deep Reinforcement LearningHoldout Set	—Unverified	0
Morphological Change Forecasting for Prostate Glands using Feature-based Registration and Kernel Density Extrapolation	Jan 16, 2021	Density EstimationHoldout Set	—Unverified	0
Navigating Towards Fairness with Data Selection	Dec 15, 2024	FairnessHoldout Set	—Unverified	0
Outcome-based Reinforcement Learning to Predict the Future	May 23, 2025	Holdout SetMath	—Unverified	0
Adaptive Statistical Learning with Bayesian Differential Privacy	Nov 2, 2019	Holdout Set	—Unverified	0
A Meta-Analysis of Overfitting in Machine Learning	Dec 1, 2019	BIG-bench Machine LearningHoldout Set	—Unverified	0
Application of Deep Neural Networks to assess corporate Credit Rating	Mar 4, 2020	BIG-bench Machine LearningHoldout Set	—Unverified	0
Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts	Oct 11, 2024	Holdout SetMisconceptions	—Unverified	0
Challenges in Bayesian Adaptive Data Analysis	Apr 8, 2016	Holdout Set	—Unverified	0
Predicting Individual Responses to Vasoactive Medications in Children with Septic Shock	Jan 15, 2019	Holdout Setregression	—Unverified	0
STAND: Data-Efficient and Self-Aware Precondition Induction for Interactive Task Learning	Sep 11, 2024	Active LearningHoldout Set	—Unverified	0
The Benefits and Risks of Transductive Approaches for AI Fairness	Jun 17, 2024	FairnessHoldout Set	—Unverified	0
The DCR Delusion: Measuring the Privacy Risk of Synthetic Data	May 2, 2025	Holdout Set	—Unverified	0
The Generic Holdout: Preventing False-Discoveries in Adaptive Data Science	Sep 14, 2018	Holdout Set	—Unverified	0
Using Poisson Binomial GLMs to Reveal Voter Preferences	Feb 4, 2018	Holdout Set	—Unverified	0
Who Wins the Game of Thrones? How Sentiments Improve the Prediction of Candidate Choice	Feb 29, 2020	BenchmarkingHoldout Set	—Unverified	0
Testing for Overfitting	May 9, 2023	Holdout Setvalid	CodeCode Available	0

Show:10 25 50

← PrevPage 1 of 2Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	BloodAxe, 1st place xView3 prize challenge	Aggregate xView3 Score	0.62	—	Unverified
2	selim_sef, 2nd place xView3 prize challenge	Aggregate xView3 Score	0.6	—	Unverified
3	Tumen, 3rd place xView3 prize challenge	Aggregate xView3 Score	0.58	—	Unverified
4	Skylight at AI2, 4th place xView3 prize challenge	Aggregate xView3 Score	0.58	—	Unverified
5	Kohei, 5th place xView3 prize challenge	Aggregate xView3 Score	0.57	—	Unverified