SOTAVerified

Decision Making

Papers

Showing 211220 of 12311 papers

TitleStatusHype
Context is Key: A Benchmark for Forecasting with Essential Textual InformationCode2
Aligning Superhuman AI with Human Behavior: Chess as a Model SystemCode2
A Comprehensive Guide to Explainable AI: From Classical Models to LLMsCode2
Position: Foundation Agents as the Paradigm Shift for Decision MakingCode2
Cross-Prediction-Powered InferenceCode2
ChartGemma: Visual Instruction-tuning for Chart Reasoning in the WildCode2
PRMBench: A Fine-grained and Challenging Benchmark for Process-Level Reward ModelsCode2
ProAgent: From Robotic Process Automation to Agentic Process AutomationCode2
DrivingSphere: Building a High-fidelity 4D World for Closed-loop SimulationCode2
V-Max: A Reinforcement Learning Framework for Autonomous DrivingCode2
Show:102550
← PrevPage 22 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified