SOTAVerified

Decision Making

Papers

Showing 26912700 of 12311 papers

TitleStatusHype
Test-Time Fairness and Robustness in Large Language Models0
Optimal policy design for decision problems under social influence0
Capacity Credit Evaluation of Generalized Energy Storage Considering Strategic Capacity Withholding and Decision-Dependent Uncertainty0
Beyond ELBOs: A Large-Scale Evaluation of Variational Methods for SamplingCode1
Enhancing Tabular Data Optimization with a Flexible Graph-based Reinforced Exploration Strategy0
World Models with Hints of Large Language Models for Goal Achieving0
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8BCode5
Trustworthy and Practical AI for Healthcare: A Guided Deferral System with Large Language Models0
Long-Term Fairness Inquiries and Pursuits in Machine Learning: A Survey of Notions, Methods, and Challenges0
Satisficing Exploration in Bandit Optimization0
Show:102550
← PrevPage 270 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified