SOTAVerified

Decision Making

Papers

Showing 19512000 of 12311 papers

TitleStatusHype
Human-AI Collaboration in Cloud Security: Cognitive Hierarchy-Driven Deep Reinforcement Learning0
Demand Forecasting for Electric Vehicle Charging Stations using Multivariate Time-Series Analysis0
A Review of Causal Decision Making0
Moving Beyond Medical Exam Questions: A Clinician-Annotated Dataset of Real-World Tasks and Ambiguity in Mental HealthcareCode0
Function-coherent gambles0
Exploring Embodied Multimodal Large Models: Development, Datasets, and Future Directions0
The Evolving Landscape of LLM- and VLM-Integrated Reinforcement Learning0
A Comprehensive Survey on the Trustworthiness of Large Language Models in Healthcare0
Graph Attention Convolutional U-NET: A Semantic Segmentation Model for Identifying Flooded Areas0
A Knowledge Distillation-Based Approach to Enhance Transparency of Classifier ModelsCode0
Interaction-Aware Model Predictive Decision-Making for Socially-Compliant Autonomous Driving in Mixed Urban Traffic Scenarios0
Doctor-in-the-Loop: An Explainable, Multi-View Deep Learning Framework for Predicting Pathological Response in Non-Small Cell Lung Cancer0
Detecting Future-related Contexts of Entity Mentions0
Med-gte-hybrid: A contextual embedding transformer model for extracting actionable information from clinical texts0
Human Misperception of Generative-AI Alignment: A Laboratory Experiment0
Alignment, Agency and Autonomy in Frontier AI: A Systems Engineering Perspective0
Beyond Self-Talk: A Communication-Centric Survey of LLM-Based Multi-Agent Systems0
Mem2Ego: Empowering Vision-Language Models with Global-to-Ego Memory for Long-Horizon Embodied Navigation0
Reinforcement Learning for Ultrasound Image Analysis A Comprehensive Review of Advances and Applications0
Investigating the Impact of LLM Personality on Cognitive Bias Manifestation in Automated Decision-Making Tasks0
MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models0
The Impact and Feasibility of Self-Confidence Shaping for AI-Assisted Decision-Making0
Online detection of forecast model inadequacies using forecast errors0
An Interpretable Machine Learning Approach to Understanding the Relationships between Solar Flares and Source Active Regions0
SPRIG: Stackelberg Perception-Reinforcement Learning with Internal Game Dynamics0
Fighter Jet Navigation and Combat using Deep Reinforcement Learning with Explainable AICode0
LLM-Enhanced Dialogue Management for Full-Duplex Spoken Dialogue Systems0
Why Safeguarded Ships Run Aground? Aligned Large Language Models' Safety Mechanisms Tend to Be Anchored in The Template Region0
RGAR: Recurrence Generation-augmented Retrieval for Factual-aware Medical Question Answering0
MCTS-KBQA: Monte Carlo Tree Search for Knowledge Base Question Answering0
Human-Artificial Interaction in the Age of Agentic AI: A System-Theoretical Approach0
Black Sheep in the Herd: Playing with Spuriously Correlated Attributes for Vision-Language Recognition0
Playing Hex and Counter Wargames using Reinforcement Learning and Recurrent Neural NetworksCode0
AgentCF++: Memory-enhanced LLM-based Agents for Popularity-aware Cross-domain RecommendationsCode0
AEIA-MN: Evaluating the Robustness of Multimodal LLM-Powered Mobile Agents Against Active Environmental Injection Attacks0
Towards Robust and Secure Embodied AI: A Survey on Vulnerabilities and Attacks0
Adjust for Trust: Mitigating Trust-Induced Inappropriate Reliance on AI Assistance0
Value Gradient Sampler: Sampling as Sequential Decision MakingCode0
Capturing Human Cognitive Styles with Language: Towards an Experimental Evaluation Paradigm0
Should I Trust You? Detecting Deception in Negotiations using Counterfactual RL0
Adaptive Tool Use in Large Language Models with Meta-Cognition Trigger0
MindLLM: A Subject-Agnostic and Versatile Model for fMRI-to-Text Decoding0
AI-Assisted Decision Making with Human Learning0
Conditional Max-Sum for Asynchronous Multiagent Decision Making0
LLM Trading: Analysis of LLM Agent Behavior in Experimental Asset Markets0
One for All: A General Framework of LLMs-based Multi-Criteria Decision Making on Human Expert Level0
QoS based resource management for concurrent operation using MCTS0
Human-centered explanation does not fit all: The interplay of sociotechnical, cognitive, and individual factors in the effect AI explanations in algorithmic decision-making0
ExaGPT: Example-Based Machine-Generated Text Detection for Human Interpretability0
DELMAN: Dynamic Defense Against Large Language Model Jailbreaking with Model Editing0
Show:102550
← PrevPage 40 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified