SOTAVerified

Decision Making

Papers

Showing 25312540 of 12311 papers

TitleStatusHype
View From Above: A Framework for Evaluating Distribution Shifts in Model BehaviorCode0
EconNLI: Evaluating Large Language Models on Economics ReasoningCode0
DynaThink: Fast or Slow? A Dynamic Decision-Making Framework for Large Language Models0
Let Hybrid A* Path Planner Obey Traffic Rules: A Deep Reinforcement Learning-Based Planning Framework0
Diffusion Forcing: Next-token Prediction Meets Full-Sequence DiffusionCode9
OSL-ActionSpotting: A Unified Library for Action Spotting in Sports Videos0
Improve ROI with Causal Learning and Conformal Prediction0
Improving Trip Mode Choice Modeling Using Ensemble Synthesizer (ENSY)0
Enhancing Travel Decision-Making: A Contrastive Learning Approach for Personalized Review Rankings in Accommodations0
MUSE-Net: Missingness-aware mUlti-branching Self-attention Encoder for Irregular Longitudinal Electronic Health Records0
Show:102550
← PrevPage 254 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified