SOTAVerified

Decision Making

Papers

Showing 19511975 of 12311 papers

TitleStatusHype
Moving Beyond Medical Exam Questions: A Clinician-Annotated Dataset of Real-World Tasks and Ambiguity in Mental HealthcareCode0
Function-coherent gambles0
Demand Forecasting for Electric Vehicle Charging Stations using Multivariate Time-Series Analysis0
Risk-Averse Reinforcement Learning: An Optimal Transport Perspective on Temporal Difference LearningCode0
A Review of Causal Decision Making0
Interaction-Aware Model Predictive Decision-Making for Socially-Compliant Autonomous Driving in Mixed Urban Traffic Scenarios0
Detecting Future-related Contexts of Entity Mentions0
A Comprehensive Survey on the Trustworthiness of Large Language Models in Healthcare0
Graph Attention Convolutional U-NET: A Semantic Segmentation Model for Identifying Flooded Areas0
The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning0
Doctor-in-the-Loop: An Explainable, Multi-View Deep Learning Framework for Predicting Pathological Response in Non-Small Cell Lung Cancer0
A Knowledge Distillation-Based Approach to Enhance Transparency of Classifier ModelsCode0
Exploring Embodied Multimodal Large Models: Development, Datasets, and Future Directions0
Med-gte-hybrid: A contextual embedding transformer model for extracting actionable information from clinical texts0
Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications0
Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation0
MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models0
Alignment, Agency and Autonomy in Frontier AI: A Systems Engineering Perspective0
Investigating the Impact of LLM Personality on Cognitive Bias Manifestation in Automated Decision-Making Tasks0
SPRIG: Stackelberg Perception-Reinforcement Learning with Internal Game Dynamics0
Human Misperception of Generative-AI Alignment: A Laboratory Experiment0
Online detection of forecast model inadequacies using forecast errors0
The Impact and Feasibility of Self-Confidence Shaping for AI-Assisted Decision-Making0
Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems0
An Interpretable Machine Learning Approach to Understanding the Relationships between Solar Flares and Source Active Regions0
Show:102550
← PrevPage 79 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified