SOTAVerified

Decision Making

Papers

Showing 151200 of 12311 papers

TitleStatusHype
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning0
From Turbulence to Tranquility: AI-Driven Low-Altitude Network0
Interpretable reinforcement learning for heat pump control through asymmetric differentiable decision trees0
Sparse Imagination for Efficient Visual World Model Planning0
A Graph-Retrieval-Augmented Generation Framework Enhances Decision-Making in the Circular Economy0
Higher-Order Responsibility0
ARIA: Training Language Agents with Intention-Driven Reward Aggregation0
An application of machine learning to the motion response prediction of floating assets0
Speculative Reward Model Boosts Decision Making Ability of LLMs Cost-EffectivelyCode0
World Models for Cognitive Agents: Transforming Edge Intelligence in Future Networks0
MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility0
A Reinforcement Learning-Based Telematic Routing Protocol for the Internet of Underwater Things0
Who Gets the Kidney? Human-AI Alignment, Indecision, and Moral Values0
Performative Risk Control: Calibrating Models for Reliable Deployment under Performativity0
ROAD: Responsibility-Oriented Reward Design for Reinforcement Learning in Autonomous Driving0
Random Rule Forest (RRF): Interpretable Ensembles of LLM-Generated Questions for Predicting Startup Success0
Effects of Theory of Mind and Prosocial Beliefs on Steering Human-Aligned Behaviors of LLMs in Ultimatum GamesCode0
Causal-aware Large Language Models: Enhancing Decision-Making Through Learning, Adapting and ActingCode1
Multi-criteria Rank-based Aggregation for Explainable AICode0
Object Centric Concept Bottlenecks0
Literature Review Of Multi-Agent Debate For Problem-Solving0
DATD3: Depthwise Attention Twin Delayed Deep Deterministic Policy Gradient For Model Free Reinforcement Learning Under Output Feedback Control0
Stable Thompson Sampling: Valid Inference via Variance Inflation0
K^2VAE: A Koopman-Kalman Enhanced Variational AutoEncoder for Probabilistic Time Series ForecastingCode1
Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation0
CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model AgentsCode0
Bounded-Abstention Pairwise Learning to Rank0
TRAP: Targeted Redirecting of Agentic Preferences0
DiCoFlex: Model-agnostic diverse counterfactuals with flexible control0
Cognitive Guardrails for Open-World Decision Making in Autonomous Drone Swarms0
A Unified Framework for Human AI Collaboration in Security Operations Centers with Trusted Autonomy0
Going from a Representative Agent to Counterfactuals in Combinatorial Choice0
From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems0
Second Opinion Matters: Towards Adaptive Clinical AI via the Consensus of Expert Model Ensemble0
Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent SystemsCode0
Be.FM: Open Foundation Models for Human Behavior0
DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes0
On the Interplay of Privacy, Persuasion and Quantization0
Design and testing of an agent chatbot supporting decision making with public transport data0
Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean-Field Games0
HiLDe: Intentional Code Generation via Human-in-the-Loop Decoding0
VIGNETTE: Socially Grounded Bias Evaluation for Vision-Language ModelsCode0
A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems0
AZT1D: A Real-World Dataset for Type 1 Diabetes0
DriveRX: A Vision-Language Reasoning Model for Cross-Task Autonomous Driving0
Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO0
Constructing a bridge between functioning of oscillatory neuronal networks and quantum-like cognition along with quantum-inspired computation and AI0
E2E Process Automation Leveraging Generative AI and IDP-Based Automation Agent: A Case Study on Corporate Expense Processing0
Learning optimal treatment strategies for intraoperative hypotension using deep reinforcement learning0
Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making0
Show:102550
← PrevPage 4 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified