SOTAVerified|Agents Browse Leaderboard About Blog

Decision Making

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 31–40 of 12311 papers

Title	Date	Tasks	Status	Hype	Score
Constitutional AI: Harmlessness from AI Feedback	Dec 15, 2022	Decision Making	CodeCode Available	4	5
AgentBench: Evaluating LLMs as Agents	Aug 7, 2023	Decision MakingInstruction Following	CodeCode Available	4	5
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement Learning	Mar 20, 2025	Decision MakingLanguage Modeling	CodeCode Available	4	5
Cognitive Architectures for Language Agents	Sep 5, 2023	Decision Making	CodeCode Available	4	5
Mastering Diverse Domains through World Models	Jan 10, 2023	Atari Games 100kDecision Making	CodeCode Available	4	5
pgmpy: A Python Toolkit for Bayesian Networks	Apr 17, 2023	Causal DiscoveryCausal Identification	CodeCode Available	4	5
Behavior Generation with Latent Actions	Mar 5, 2024	Autonomous DrivingDecision Making	CodeCode Available	3	5
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution	Apr 9, 2025	2kDecision Making	CodeCode Available	3	5
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models	Nov 29, 2024	Decision MakingRAG	CodeCode Available	3	5
ACEGEN: Reinforcement learning of generative chemical agents for drug discovery	May 7, 2024	BenchmarkingDecision Making	CodeCode Available	3	5

Show:10 25 50

← PrevPage 4 of 1232Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SRLA	Average Remaining Cycles	6.4	—	Unverified