SOTAVerified

Decision Making

Papers

Showing 6170 of 12311 papers

TitleStatusHype
Beyond A*: Better Planning with Transformers via Search Dynamics BootstrappingCode3
UniST: A Prompt-Empowered Universal Model for Urban Spatio-Temporal PredictionCode3
SPO: Sequential Monte Carlo Policy OptimisationCode3
V-IRL: Grounding Virtual Intelligence in Real LifeCode3
PokeLLMon: A Human-Parity Agent for Pokemon Battles with Large Language ModelsCode3
Evaluating Language Model Agency through NegotiationsCode3
LIBERO: Benchmarking Knowledge Transfer for Lifelong Robot LearningCode3
Hierarchical Prompting Assists Large Language Model on Web NavigationCode3
Planning with Diffusion for Flexible Behavior SynthesisCode3
Attention is not not ExplanationCode3
Show:102550
← PrevPage 7 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified