SOTAVerified|Agents Browse Leaderboard About Blog

Decision Making

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 21–30 of 12311 papers

Title	Date	Tasks	Status	Hype	Score
A Survey on Large Language Model-Based Game Agents	Apr 2, 2024	Decision MakingLanguage Modeling	CodeCode Available	4	5
OmniDrive: A Holistic Vision-Language Dataset for Autonomous Driving with Counterfactual Reasoning	May 2, 2024	Autonomous Drivingcounterfactual	CodeCode Available	4	5
Mastering Diverse Domains through World Models	Jan 10, 2023	Atari Games 100kDecision Making	CodeCode Available	4	5
Cognitive Architectures for Language Agents	Sep 5, 2023	Decision Making	CodeCode Available	4	5
Constitutional AI: Harmlessness from AI Feedback	Dec 15, 2022	Decision Making	CodeCode Available	4	5
pgmpy: A Python Toolkit for Bayesian Networks	Apr 17, 2023	Causal DiscoveryCausal Identification	CodeCode Available	4	5
AutoWebGLM: A Large Language Model-based Web Navigating Agent	Apr 4, 2024	Decision MakingLanguage Modeling	CodeCode Available	4	5
Is Sora a World Simulator? A Comprehensive Survey on General World Models and Beyond	May 6, 2024	Autonomous DrivingDecision Making	CodeCode Available	4	5
Agent Q: Advanced Reasoning and Learning for Autonomous AI Agents	Aug 13, 2024	Decision Making	CodeCode Available	4	5
AgentBench: Evaluating LLMs as Agents	Aug 7, 2023	Decision MakingInstruction Following	CodeCode Available	4	5

Show:10 25 50

← PrevPage 3 of 1232Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SRLA	Average Remaining Cycles	6.4	—	Unverified