SOTAVerified|Agents Browse Leaderboard About Blog

Decision Making

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 61–70 of 12311 papers

Title	Date	Tasks	Status	Hype	Score
Evaluating Language Model Agency through Negotiations	Jan 9, 2024	Decision MakingLanguage Modeling	CodeCode Available	3	5
Embodied CoT Distillation From LLM To Off-the-shelf Agents	Dec 16, 2024	Decision MakingIn-Context Learning	CodeCode Available	3	5
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision Making	Oct 9, 2024	BenchmarkingDecision Making	CodeCode Available	3	5
Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python	Apr 9, 2024	Decision MakingLanguage Modeling	CodeCode Available	3	5
A Survey on the Optimization of Large Language Model-based Agents	Mar 16, 2025	Decision MakingLanguage Modeling	CodeCode Available	3	5
A Demonstration of Adaptive Collaboration of Large Language Models for Medical Decision-Making	Oct 31, 2024	Decision MakingDiagnostic	CodeCode Available	3	5
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models	Nov 29, 2024	Decision MakingRAG	CodeCode Available	3	5
Evolve Cost-aware Acquisition Functions Using Large Language Models	Apr 25, 2024	Bayesian OptimizationDecision Making	CodeCode Available	3	5
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution	Apr 9, 2025	2kDecision Making	CodeCode Available	3	5
V-IRL: Grounding Virtual Intelligence in Real Life	Feb 5, 2024	Decision Making	CodeCode Available	3	5

Show:10 25 50

← PrevPage 7 of 1232Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SRLA	Average Remaining Cycles	6.4	—	Unverified