SOTAVerified

Decision Making

Papers

Showing 6170 of 12311 papers

TitleStatusHype
Evaluating Language Model Agency through NegotiationsCode3
Embodied CoT Distillation From LLM To Off-the-shelf AgentsCode3
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision MakingCode3
Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in PythonCode3
A Survey on the Optimization of Large Language Model-based AgentsCode3
A Demonstration of Adaptive Collaboration of Large Language Models for Medical Decision-MakingCode3
Auto-RAG: Autonomous Retrieval-Augmented Generation for Large Language ModelsCode3
Evolve Cost-aware Acquisition Functions Using Large Language ModelsCode3
FlashDepth: Real-time Streaming Video Depth Estimation at 2K ResolutionCode3
V-IRL: Grounding Virtual Intelligence in Real LifeCode3
Show:102550
← PrevPage 7 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified