SOTAVerified

Decision Making

Papers

Showing 1120 of 12311 papers

TitleStatusHype
Maia-2: A Unified Model for Human-AI Alignment in ChessCode5
Accessing GPT-4 level Mathematical Olympiad Solutions via Monte Carlo Tree Self-refine with LLaMa-3 8BCode5
GraphCast: Learning skillful medium-range global weather forecastingCode5
Multi-Agent Reinforcement Learning for Autonomous Driving: A SurveyCode5
Deep Lake: a Lakehouse for Deep LearningCode5
Large Language Model based Multi-Agents: A Survey of Progress and ChallengesCode5
LM Transparency Tool: Interactive Tool for Analyzing Transformer Language ModelsCode5
Differentiable Tree Search NetworkCode5
Neural Fields in Robotics: A SurveyCode5
AutoWebGLM: A Large Language Model-based Web Navigating AgentCode4
Show:102550
← PrevPage 2 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified