SOTAVerified|Agents Browse Leaderboard About

Holdout Set

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 35 papers

Title	Date	Tasks	Status	Hype	Score
Predicting Individual Responses to Vasoactive Medications in Children with Septic Shock	Jan 15, 2019	Holdout Setregression	—Unverified	0	0
Using Poisson Binomial GLMs to Reveal Voter Preferences	Feb 4, 2018	Holdout Set	—Unverified	0	0
STAND: Data-Efficient and Self-Aware Precondition Induction for Interactive Task Learning	Sep 11, 2024	Active LearningHoldout Set	—Unverified	0	0
Benchmark Inflation: Revealing LLM Performance Gaps Using Retro-Holdouts	Oct 11, 2024	Holdout SetMisconceptions	—Unverified	0	0
Adaptive Statistical Learning with Bayesian Differential Privacy	Nov 2, 2019	Holdout Set	—Unverified	0	0
The Benefits and Risks of Transductive Approaches for AI Fairness	Jun 17, 2024	FairnessHoldout Set	—Unverified	0	0
The DCR Delusion: Measuring the Privacy Risk of Synthetic Data	May 2, 2025	Holdout Set	—Unverified	0	0
Diversified Ensembling: An Experiment in Crowdsourced Machine Learning	Feb 16, 2024	FairnessHoldout Set	—Unverified	0	0
The Generic Holdout: Preventing False-Discoveries in Adaptive Data Science	Sep 14, 2018	Holdout Set	—Unverified	0	0
Challenges in Bayesian Adaptive Data Analysis	Apr 8, 2016	Holdout Set	—Unverified	0	0

Show:10 25 50

← PrevPage 3 of 4Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	BloodAxe, 1st place xView3 prize challenge	Aggregate xView3 Score	0.62	—	Unverified
2	selim_sef, 2nd place xView3 prize challenge	Aggregate xView3 Score	0.6	—	Unverified
3	Tumen, 3rd place xView3 prize challenge	Aggregate xView3 Score	0.58	—	Unverified
4	Skylight at AI2, 4th place xView3 prize challenge	Aggregate xView3 Score	0.58	—	Unverified
5	Kohei, 5th place xView3 prize challenge	Aggregate xView3 Score	0.57	—	Unverified