SOTAVerified

Decision Making

Papers

Showing 591600 of 12311 papers

TitleStatusHype
What Makes an Evaluation Useful? Common Pitfalls and Best Practices0
OpenDriveVLA: Towards End-to-end Autonomous Driving with Large Vision Language Action ModelCode4
Exploring Explainable Multi-player MCTS-minimax Hybrids in Board Game Using Process Mining0
Towards Trustworthy GUI Agents: A SurveyCode0
Reinforcement Learning-based Token Pruning in Vision Transformers: A Markov Game ApproachCode0
Iterative VCG-based Mechanism Fosters Cooperation in Multi-Regional Network Design0
Towards Interpretable Counterfactual Generation via Multimodal Autoregression0
A Training-free LLM Framework with Interaction between Contextually Related Subtasks in Solving Complex Tasks0
Towards Personalized Conversational Sales Agents : Contextual User Profiling for Strategic Action0
GroundHog: Revolutionizing GLDAS Groundwater Storage Downscaling for Enhanced Recharge Estimation in Bangladesh0
Show:102550
← PrevPage 60 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified