SOTAVerified|Agents Browse Leaderboard About

Decision Making

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 931–940 of 12311 papers

Title	Date	Tasks	Status	Hype
MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering	Feb 19, 2025	Decision MakingKnowledge Base Question Answering	—Unverified	0
Fraud-R1 : A Multi-Round Benchmark for Assessing the Robustness of LLM Against Augmented Fraud and Phishing Inducements	Feb 18, 2025	Decision MakingFraud Detection	CodeCode Available	1
LLM Trading: Analysis of LLM Agent Behavior in Experimental Asset Markets	Feb 18, 2025	Decision Making	—Unverified	0
Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL	Feb 18, 2025	counterfactualDeception Detection	—Unverified	0
MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding	Feb 18, 2025	Decision Making	—Unverified	0
AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks	Feb 18, 2025	Decision Making	—Unverified	0
Value Gradient Sampler: Sampling as Sequential Decision Making	Feb 18, 2025	Anomaly DetectionDecision Making	CodeCode Available	0
Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger	Feb 18, 2025	Decision Making	—Unverified	0
Capturing Human Cognitive Styles with Language: Towards an Experimental Evaluation Paradigm	Feb 18, 2025	Decision Making	—Unverified	0
Adjust for Trust: Mitigating Trust-Induced Inappropriate Reliance on AI Assistance	Feb 18, 2025	Decision Making	—Unverified	0

Show:10 25 50

← PrevPage 94 of 1232Next →

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	SRLA	Average Remaining Cycles	6.4	—	Unverified