SOTAVerified

Decision Making

Papers

Showing 3140 of 12311 papers

TitleStatusHype
Mastering Diverse Domains through World ModelsCode4
Agent Q: Advanced Reasoning and Learning for Autonomous AI AgentsCode4
Is Sora a World Simulator? A Comprehensive Survey on General World Models and BeyondCode4
Cognitive Architectures for Language AgentsCode4
AgentBench: Evaluating LLMs as AgentsCode4
pgmpy: A Python Toolkit for Bayesian NetworksCode4
Behavior Generation with Latent ActionsCode3
FlashDepth: Real-time Streaming Video Depth Estimation at 2K ResolutionCode3
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language ModelsCode3
MDAgents: An Adaptive Collaboration of LLMs for Medical Decision-MakingCode3
Show:102550
← PrevPage 4 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified