SOTAVerified

Decision Making

Papers

Showing 781790 of 12311 papers

TitleStatusHype
ALMA: Hierarchical Learning for Composite Multi-Agent TasksCode1
Benchmarks for Deep Off-Policy EvaluationCode1
BLADE: Benchmarking Language Model Agents for Data-Driven ScienceCode1
Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement LearningCode1
LLM-SAP: Large Language Models Situational Awareness Based PlanningCode1
Large-scale moral machine experiment on large language modelsCode1
Active Inference and Behavior Trees for Reactive Action Planning and Execution in RoboticsCode1
Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on GraphsCode1
Active Fire Detection in Landsat-8 Imagery: a Large-Scale Dataset and a Deep-Learning StudyCode1
A User's Guide to Calibrating Robotics SimulatorsCode1
Show:102550
← PrevPage 79 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified