SOTAVerified

Decision Making

Papers

Showing 6170 of 12311 papers

TitleStatusHype
ACEGEN: Reinforcement learning of generative chemical agents for drug discoveryCode3
Evaluating Language Model Agency through NegotiationsCode3
Embodied Agent Interface: Benchmarking LLMs for Embodied Decision MakingCode3
Embodied CoT Distillation From LLM To Off-the-shelf AgentsCode3
A Survey on the Optimization of Large Language Model-based AgentsCode3
A Smart Multimodal Healthcare Copilot with Powerful LLM ReasoningCode3
Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in PythonCode3
Evolve Cost-aware Acquisition Functions Using Large Language ModelsCode3
FlashDepth: Real-time Streaming Video Depth Estimation at 2K ResolutionCode3
V-IRL: Grounding Virtual Intelligence in Real LifeCode3
Show:102550
← PrevPage 7 of 1232Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified