SOTAVerified

Decision Making

Papers

Showing 12511300 of 12311 papers

TitleStatusHype
Finding signatures of low-dimensional geometric landscapes in high-dimensional cell fate transitionsCode0
VS-Bench: Evaluating VLMs for Strategic Reasoning and Decision-Making in Multi-Agent Environments0
Designing Algorithmic Delegates: The Role of Indistinguishability in Human-AI Handoff0
Improving Performance of Spike-based Deep Q-Learning using Ternary Neurons0
Joint Modeling for Learning Decision-Making Dynamics in Behavioral Experiments0
Medical World Model: Generative Simulation of Tumor Evolution for Treatment Planning0
Sparse Imagination for Efficient Visual World Model Planning0
Interpretable reinforcement learning for heat pump control through asymmetric differentiable decision trees0
From Turbulence to Tranquility: AI-Driven Low-Altitude Network0
Higher-Order Responsibility0
A Graph-Retrieval-Augmented Generation Framework Enhances Decision-Making in the Circular Economy0
An application of machine learning to the motion response prediction of floating assets0
World Models for Cognitive Agents: Transforming Edge Intelligence in Future Networks0
Speculative Reward Model Boosts Decision Making Ability of LLMs Cost-EffectivelyCode0
ARIA: Training Language Agents with Intention-Driven Reward Aggregation0
Random Rule Forest (RRF): Interpretable Ensembles of LLM-Generated Questions for Predicting Startup Success0
Who Gets the Kidney? Human-AI Alignment, Indecision, and Moral Values0
Object Centric Concept Bottlenecks0
Multi-criteria Rank-based Aggregation for Explainable AICode0
MedOrch: Medical Diagnosis with Tool-Augmented Reasoning Agents for Flexible Extensibility0
ROAD: Responsibility-Oriented Reward Design for Reinforcement Learning in Autonomous Driving0
Effects of Theory of Mind and Prosocial Beliefs on Steering Human-Aligned Behaviors of LLMs in Ultimatum GamesCode0
A Reinforcement Learning-Based Telematic Routing Protocol for the Internet of Underwater Things0
Performative Risk Control: Calibrating Models for Reliable Deployment under Performativity0
DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes0
Literature Review Of Multi-Agent Debate For Problem-Solving0
CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model AgentsCode0
Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent SystemsCode0
Bounded-Abstention Pairwise Learning to Rank0
Cognitive Guardrails for Open-World Decision Making in Autonomous Drone Swarms0
Going from a Representative Agent to Counterfactuals in Combinatorial Choice0
DATD3: Depthwise Attention Twin Delayed Deep Deterministic Policy Gradient For Model Free Reinforcement Learning Under Output Feedback Control0
Second Opinion Matters: Towards Adaptive Clinical AI via the Consensus of Expert Model Ensemble0
Be.FM: Open Foundation Models for Human Behavior0
A Unified Framework for Human AI Collaboration in Security Operations Centers with Trusted Autonomy0
DiCoFlex: Model-agnostic diverse counterfactuals with flexible control0
Active Layer-Contrastive Decoding Reduces Hallucination in Large Language Model Generation0
Stable Thompson Sampling: Valid Inference via Variance Inflation0
TRAP: Targeted Redirecting of Agentic Preferences0
From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems0
Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean-Field Games0
On the Interplay of Privacy, Persuasion and Quantization0
VIGNETTE: Socially Grounded Bias Evaluation for Vision-Language ModelsCode0
Design and testing of an agent chatbot supporting decision making with public transport data0
A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems0
HiLDe: Intentional Code Generation via Human-in-the-Loop Decoding0
E2E Process Automation Leveraging Generative AI and IDP-Based Automation Agent: A Case Study on Corporate Expense Processing0
AI-Supported Platform for System Monitoring and Decision-Making in Nuclear Waste Management with Large Language Models0
What Data Enables Optimal Decisions? An Exact Characterization for Linear Optimization0
Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO0
Show:102550
← PrevPage 26 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified