Holdout Set

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 1–25 of 35 papers

Title	Date	Tasks	Status	Hype	Score
Distribution-Free, Risk-Controlling Prediction Sets	Jan 7, 2021	BIG-bench Machine LearningClassification	CodeCode Available	2	5
TotalVibeSegmentator: Full Body MRI Segmentation for the NAKO and UK Biobank	May 31, 2024	EpidemiologyHoldout Set	CodeCode Available	2	5
Liver Tumor Screening and Diagnosis in CT with Pixel-Lesion-Patient Network	Jul 17, 2023	Computed Tomography (CT)Holdout Set	CodeCode Available	1	5
Understanding Transformers via N-gram Statistics	Jun 30, 2024	Holdout Set	CodeCode Available	1	5
xView3-SAR: Detecting Dark Fishing Activity Using Synthetic Aperture Radar Imagery	Jun 2, 2022	Decision Making Under UncertaintyHoldout Set	CodeCode Available	1	5
Template-Based Automatic Search of Compact Semantic Segmentation Architectures	Apr 4, 2019	General ClassificationHoldout Set	CodeCode Available	1	5
Generalization of Reinforcement Learners with Working and Episodic Memory	Oct 29, 2019	Deep Reinforcement LearningHoldout Set	CodeCode Available	0	5
A shared latent space matrix factorisation method for recommending new trial evidence for systematic review updates	Feb 27, 2018	Holdout Set	CodeCode Available	0	5
Comprehensive dataset of user-submitted articles with ideological and extreme bias from Reddit	Aug 12, 2024	ArticlesHoldout Set	CodeCode Available	0	5
Generalization in Adaptive Data Analysis and Holdout Reuse	Jun 8, 2015	Holdout Set	CodeCode Available	0	5
Holdouts set for safe predictive model updating	Feb 13, 2022	Holdout Setmodel	CodeCode Available	0	5
Parametric Scaling Law of Tuning Bias in Conformal Prediction	Feb 5, 2025	Conformal PredictionHoldout Set	CodeCode Available	0	5
Persistent Homology Captures the Generalization of Neural Networks Without A Validation Set	May 31, 2021	Holdout Set	CodeCode Available	0	5
RATT: Leveraging Unlabeled Data to Guarantee Generalization	May 1, 2021	Generalization BoundsHoldout Set	CodeCode Available	0	5
Testing for Overfitting	May 9, 2023	Holdout Setvalid	CodeCode Available	0	5
Uncovering convolutional neural network decisions for diagnosing multiple sclerosis on conventional MRI using layer-wise relevance propagation	Apr 18, 2019	Decision MakingDiagnostic	CodeCode Available	0	5
Who's the (Multi-)Fairest of Them All: Rethinking Interpolation-Based Data Augmentation Through the Lens of Multicalibration	Dec 13, 2024	AllData Augmentation	CodeCode Available	0	5
Outcome-based Reinforcement Learning to Predict the Future	May 23, 2025	Holdout SetMath	—Unverified	0	0
Who Wins the Game of Thrones? How Sentiments Improve the Prediction of Candidate Choice	Feb 29, 2020	BenchmarkingHoldout Set	—Unverified	0	0
A Meta-Analysis of Overfitting in Machine Learning	Dec 1, 2019	BIG-bench Machine LearningHoldout Set	—Unverified	0	0
Predicting Individual Responses to Vasoactive Medications in Children with Septic Shock	Jan 15, 2019	Holdout Setregression	—Unverified	0	0
Using Poisson Binomial GLMs to Reveal Voter Preferences	Feb 4, 2018	Holdout Set	—Unverified	0	0
STAND: Data-Efficient and Self-Aware Precondition Induction for Interactive Task Learning	Sep 11, 2024	Active LearningHoldout Set	—Unverified	0	0
Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts	Oct 11, 2024	Holdout SetMisconceptions	—Unverified	0	0
Adaptive Statistical Learning with Bayesian Differential Privacy	Nov 2, 2019	Holdout Set	—Unverified	0	0

Show:10 25 50

← PrevPage 1 of 2Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	BloodAxe, 1st place xView3 prize challenge	Aggregate xView3 Score	0.62	—	Unverified
2	selim_sef, 2nd place xView3 prize challenge	Aggregate xView3 Score	0.6	—	Unverified
3	Tumen, 3rd place xView3 prize challenge	Aggregate xView3 Score	0.58	—	Unverified
4	Skylight at AI2, 4th place xView3 prize challenge	Aggregate xView3 Score	0.58	—	Unverified
5	Kohei, 5th place xView3 prize challenge	Aggregate xView3 Score	0.57	—	Unverified