SOTAVerified

Decision Making

Papers

Showing 201225 of 12311 papers

TitleStatusHype
Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making0
FM-Planner: Foundation Model Guided Path Planning for Autonomous Drone NavigationCode1
A Framework for Adversarial Analysis of Decision Support Systems Prior to Deployment0
AI-Supported Platform for System Monitoring and Decision-Making in Nuclear Waste Management with Large Language Models0
Subtle Risks, Critical Failures: A Framework for Diagnosing Physical Safety of LLMs for Embodied Decision Making0
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement LearningCode2
SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale0
An Explainable Diagnostic Framework for Neurodegenerative Dementias via Reinforcement-Optimized LLM Reasoning0
Beyond Segmentation: Confidence-Aware and Debiased Estimation of Ratio-based Biomarkers0
Deriving Strategic Market Insights with Large Language Models: A Benchmark for Forward Counterfactual Generation0
Attention! You Vision Language Model Could Be Maliciously Manipulated0
Explanation User Interfaces: A Systematic Literature Review0
Amplifying Human Creativity and Problem Solving with AI Through Generative Collective Intelligence0
Towards Large Reasoning Models for Agriculture0
Structured Reinforcement Learning for Combinatorial Decision-MakingCode1
Learning to Explain: Prototype-Based Surrogate Models for LLM Classification0
A Necessary Step toward Faithfulness: Measuring and Improving Consistency in Free-Text Explanations0
CardioCoT: Hierarchical Reasoning for Multimodal Survival Analysis0
DeCoDe: Defer-and-Complement Decision-Making via Decoupled Concept Bottleneck Models0
OptiMindTune: A Multi-Agent Framework for Intelligent Hyperparameter OptimizationCode0
Effort-aware Fairness: Incorporating a Philosophy-informed, Human-centered Notion of Effort into Algorithmic Fairness Metrics0
Cognitive Biases at Play? Insights from a Bayesian Game Framework0
Retrieval Augmented Decision-Making: A Requirements-Driven, Multi-Criteria Framework for Structured Decision Support0
Pedagogy-R1: Pedagogically-Aligned Reasoning Model with Balanced Educational Benchmark0
DDO: Dual-Decision Optimization via Multi-Agent Collaboration for LLM-Based Medical Consultation0
Show:102550
← PrevPage 9 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified