SOTAVerified|Agents Browse Leaderboard About

Decision Making

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 41–50 of 12311 papers

Title	Date	Tasks	Status	Hype
Will LLMs be Professional at Fund Investment? DeepFund: A Live Arena Perspective	Mar 24, 2025	Decision Making	CodeCode Available	3
A Survey on the Optimization of Large Language Model-based Agents	Mar 16, 2025	Decision MakingLanguage Modeling	CodeCode Available	3
Parallelized Planning-Acting for Efficient LLM-based Multi-Agent Systems	Mar 5, 2025	Decision MakingLanguage Modeling	CodeCode Available	3
Automated Hypothesis Validation with Agentic Sequential Falsifications	Feb 14, 2025	Decision MakingHallucination	CodeCode Available	3
Rethinking Early Stopping: Refine, Then Calibrate	Jan 31, 2025	Decision Making	CodeCode Available	3
MineStudio: A Streamlined Package for Minecraft AI Agent Development	Dec 24, 2024	AI AgentDecision Making	CodeCode Available	3
Embodied CoT Distillation From LLM To Off-the-shelf Agents	Dec 16, 2024	Decision MakingIn-Context Learning	CodeCode Available	3
AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games	Dec 14, 2024	Decision Making	CodeCode Available	3
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language Models	Nov 29, 2024	Decision MakingRAG	CodeCode Available	3
Game-theoretic LLM: Agent Workflow for Negotiation Games	Nov 8, 2024	Decision Making	CodeCode Available	3

Show:10 25 50

← PrevPage 5 of 1232Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SRLA	Average Remaining Cycles	6.4	—	Unverified