SOTAVerified|Agents Browse Leaderboard About Blog

Decision Making

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 51–60 of 12311 papers

Title	Date	Tasks	Status	Hype
FlashDepth: Real-time Streaming Video Depth Estimation at 2K Resolution	Apr 9, 2025	2kDecision Making	CodeCode Available	3
Automated Hypothesis Validation with Agentic Sequential Falsifications	Feb 14, 2025	Decision MakingHallucination	CodeCode Available	3
AuctionNet: A Novel Benchmark for Decision-Making in Large-Scale Games	Dec 14, 2024	Decision Making	CodeCode Available	3
Evaluating Language Model Agency through Negotiations	Jan 9, 2024	Decision MakingLanguage Modeling	CodeCode Available	3
Automatic Gradient Estimation for Calibrating Crowd Models with Discrete Decision Making	Apr 6, 2024	Decision Making	CodeCode Available	3
Attention is not not Explanation	Aug 13, 2019	Decision MakingDiagnostic	CodeCode Available	3
Evolve Cost-aware Acquisition Functions Using Large Language Models	Apr 25, 2024	Bayesian OptimizationDecision Making	CodeCode Available	3
Game-theoretic LLM: Agent Workflow for Negotiation Games	Nov 8, 2024	Decision Making	CodeCode Available	3
Enhancing Decision Analysis with a Large Language Model: pyDecision a Comprehensive Library of MCDA Methods in Python	Apr 9, 2024	Decision MakingLanguage Modeling	CodeCode Available	3
Embodied CoT Distillation From LLM To Off-the-shelf Agents	Dec 16, 2024	Decision MakingIn-Context Learning	CodeCode Available	3

Show:10 25 50

← PrevPage 6 of 1232Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SRLA	Average Remaining Cycles	6.4	—	Unverified