SOTAVerified|Agents Browse Leaderboard About

Decision Making

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 61–70 of 12311 papers

Title	Date	Tasks	Status	Hype
ACEGEN: Reinforcement learning of generative chemical agents for drug discovery	May 7, 2024	BenchmarkingDecision Making	CodeCode Available	3
Evaluating Language Model Agency through Negotiations	Jan 9, 2024	Decision MakingLanguage Modeling	CodeCode Available	3
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making	Oct 9, 2024	BenchmarkingDecision Making	CodeCode Available	3
Embodied CoT Distillation From LLM To Off-the-shelf Agents	Dec 16, 2024	Decision MakingIn-Context Learning	CodeCode Available	3
A Survey on the Optimization of Large Language Model-based Agents	Mar 16, 2025	Decision MakingLanguage Modeling	CodeCode Available	3
A Smart Multimodal Healthcare Copilot with Powerful LLM Reasoning	Jun 3, 2025	Decision MakingDiagnostic	CodeCode Available	3
Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python	Apr 9, 2024	Decision MakingLanguage Modeling	CodeCode Available	3
Evolve Cost-aware Acquisition Functions Using Large Language Models	Apr 25, 2024	Bayesian OptimizationDecision Making	CodeCode Available	3
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution	Apr 9, 2025	2kDecision Making	CodeCode Available	3
V-IRL: Grounding Virtual Intelligence in Real Life	Feb 5, 2024	Decision Making	CodeCode Available	3

Show:10 25 50

← PrevPage 7 of 1232Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SRLA	Average Remaining Cycles	6.4	—	Unverified