SOTAVerified

Decision Making

Papers

Showing 201250 of 12311 papers

TitleStatusHype
Learning optimal treatment strategies for intraoperative hypotension using deep reinforcement learning0
FM-Planner: Foundation Model Guided Path Planning for Autonomous Drone NavigationCode1
AI-Supported Platform for System Monitoring and Decision-Making in Nuclear Waste Management with Large Language Models0
A Framework for Adversarial Analysis of Decision Support Systems Prior to Deployment0
Subtle Risks, Critical Failures: A Framework for Diagnosing Physical Safety of LLMs for Embodied Decision Making0
Divide and Conquer: Grounding LLMs as Efficient Decision-Making Agents via Offline Hierarchical Reinforcement LearningCode2
SwarmThinkers: Learning Physically Consistent Atomic KMC Transitions at Scale0
Deriving Strategic Market Insights with Large Language Models: A Benchmark for Forward Counterfactual Generation0
An Explainable Diagnostic Framework for Neurodegenerative Dementias via Reinforcement-Optimized LLM Reasoning0
Attention! You Vision Language Model Could Be Maliciously Manipulated0
Beyond Segmentation: Confidence-Aware and Debiased Estimation of Ratio-based Biomarkers0
Explanation User Interfaces: A Systematic Literature Review0
Amplifying Human Creativity and Problem Solving with AI Through Generative Collective Intelligence0
Towards Large Reasoning Models for Agriculture0
Structured Reinforcement Learning for Combinatorial Decision-MakingCode1
Learning to Explain: Prototype-Based Surrogate Models for LLM Classification0
A Necessary Step toward Faithfulness: Measuring and Improving Consistency in Free-Text Explanations0
CardioCoT: Hierarchical Reasoning for Multimodal Survival Analysis0
Effort-aware Fairness: Incorporating a Philosophy-informed, Human-centered Notion of Effort into Algorithmic Fairness Metrics0
OptiMindTune: A Multi-Agent Framework for Intelligent Hyperparameter OptimizationCode0
DeCoDe: Defer-and-Complement Decision-Making via Decoupled Concept Bottleneck Models0
Cognitive Biases at Play? Insights from a Bayesian Game Framework0
Retrieval Augmented Decision-Making: A Requirements-Driven, Multi-Criteria Framework for Structured Decision Support0
Pedagogy-R1: Pedagogically-Aligned Reasoning Model with Balanced Educational Benchmark0
Finite-Time Global Optimality Convergence in Deep Neural Actor-Critic Methods for Decentralized Multi-Agent Reinforcement Learning0
DDO: Dual-Decision Optimization via Multi-Agent Collaboration for LLM-Based Medical Consultation0
Marginal Fairness: Fair Decision-Making under Risk Measures0
EdgeAgentX: A Novel Framework for Agentic AI at the Edge in Military Communication Networks0
Misaligning Reasoning with Answers -- A Framework for Assessing LLM CoT Robustness0
Development of Interactive Nomograms for Predicting Short-Term Survival in ICU Patients with Aplastic Anemia0
KL-regularization Itself is Differentially Private in Bandits and RLHF0
EVM-Fusion: An Explainable Vision Mamba Architecture with Neural Algorithmic Fusion0
Learning Representational Disparities0
Towards Uncertainty Aware Task Delegation and Human-AI Collaborative Decision-Making0
VIBE: Video-to-Text Information Bottleneck Evaluation for TL;DRCode0
An Outlook on the Opportunities and Challenges of Multi-Agent AI Systems0
Personalizing Student-Agent Interactions Using Log-Contextualized Retrieval Augmented Generation (RAG)0
Semantic-Aware Interpretable Multimodal Music Auto-TaggingCode0
Multi-Objective Optimization Algorithms for Energy Management Systems in Microgrids: A Control Strategy Based on a PHIL System0
SOLVE: Synergy of Language-Vision and End-to-End Networks for Autonomous Driving0
Sequential Monte Carlo for Policy Optimization in Continuous POMDPs0
Fuzzy Information Evolution with Three-Way Decision in Social Network Group Decision-Making0
Unlocking Smarter Device Control: Foresighted Planning with a World Model-Driven Code Execution Approach0
SoccerChat: Integrating Multimodal Data for Enhanced Soccer Game UnderstandingCode0
No Black Boxes: Interpretable and Interactable Predictive Healthcare with Knowledge-Enhanced Agentic Causal Discovery0
AppealCase: A Dataset and Benchmark for Civil Case Appeal ScenariosCode0
ViQAgent: Zero-Shot Video Question Answering via Agent with Open-Vocabulary Grounding ValidationCode0
Explaining Puzzle Solutions in Natural Language: An Exploratory Study on 6x6 Sudoku0
Agentic Feature Augmentation: Unifying Selection and Generation with Teaming, Planning, and Memories0
Finding separatrices of dynamical flows with Deep Koopman Eigenfunctions0
Show:102550
← PrevPage 5 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified