SOTAVerified

Decision Making

Papers

Showing 176200 of 12311 papers

TitleStatusHype
CDR-Agent: Intelligent Selection and Execution of Clinical Decision Rules Using Large Language Model AgentsCode0
Bounded-Abstention Pairwise Learning to Rank0
TRAP: Targeted Redirecting of Agentic Preferences0
DiCoFlex: Model-agnostic diverse counterfactuals with flexible control0
Cognitive Guardrails for Open-World Decision Making in Autonomous Drone Swarms0
A Unified Framework for Human AI Collaboration in Security Operations Centers with Trusted Autonomy0
Going from a Representative Agent to Counterfactuals in Combinatorial Choice0
From Connectivity to Autonomy: The Dawn of Self-Evolving Communication Systems0
Second Opinion Matters: Towards Adaptive Clinical AI via the Consensus of Expert Model Ensemble0
Understanding the Information Propagation Effects of Communication Topologies in LLM-based Multi-Agent SystemsCode0
Be.FM: Open Foundation Models for Human Behavior0
DIP-R1: Deep Inspection and Perception with RL Looking Through and Understanding Complex Scenes0
On the Interplay of Privacy, Persuasion and Quantization0
Design and testing of an agent chatbot supporting decision making with public transport data0
Finite-Sample Convergence Bounds for Trust Region Policy Optimization in Mean-Field Games0
HiLDe: Intentional Code Generation via Human-in-the-Loop Decoding0
VIGNETTE: Socially Grounded Bias Evaluation for Vision-Language ModelsCode0
A Large Language Model-Enabled Control Architecture for Dynamic Resource Capability Exploration in Multi-Agent Manufacturing Systems0
AZT1D: A Real-World Dataset for Type 1 Diabetes0
DriveRX: A Vision-Language Reasoning Model for Cross-Task Autonomous Driving0
Active-O3: Empowering Multimodal Large Language Models with Active Perception via GRPO0
Constructing a bridge between functioning of oscillatory neuronal networks and quantum-like cognition along with quantum-inspired computation and AI0
E2E Process Automation Leveraging Generative AI and IDP-Based Automation Agent: A Case Study on Corporate Expense Processing0
Learning optimal treatment strategies for intraoperative hypotension using deep reinforcement learning0
Silence is Not Consensus: Disrupting Agreement Bias in Multi-Agent LLMs via Catfish Agent for Clinical Decision Making0
Show:102550
← PrevPage 8 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified