SOTAVerified

Decision Making

Papers

Showing 651700 of 12311 papers

TitleStatusHype
Are AI Agents interacting with Online Ads?0
Accelerating Antibiotic Discovery with Large Language Models and Knowledge Graphs0
Feature selection strategies for optimized heart disease diagnosis using ML and DL models0
NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis PredictionCode0
Limits of trust in medical AI0
Depth Matters: Multimodal RGB-D Perception for Robust Autonomous AgentsCode0
Advancing Mobile GUI Agents: A Verifier-Driven Approach to Practical Deployment0
Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey0
Truthful Elicitation of Imprecise Forecasts0
Speeding up design and making to reduce time-to-project and time-to-market: an AI-Enhanced approach in engineering education0
Deferring Concept Bottleneck Models: Learning to Defer Interventions to Inaccurate Experts0
JARVIS-VLA: Post-Training Large-Scale Vision Language Models to Play Visual Games with Keyboards and Mouse0
Fin-R1: A Large Language Model for Financial Reasoning through Reinforcement LearningCode4
Information maximization for a broad variety of multi-armed bandit games0
Large Language Models for Water Distribution Systems Modeling and Decision-Making0
AIJIM: A Scalable Model for Real-Time AI in Environmental Journalism0
Bias Evaluation and Mitigation in Retrieval-Augmented Medical Question-Answering Systems0
Reinforcement Learning Environment with LLM-Controlled Adversary in D&D 5th Edition Combat0
Empowering Medical Multi-Agents with Clinical Consultation Flow for Dynamic Diagnosis0
Learning with Expert Abstractions for Efficient Multi-Task Continuous ControlCode0
When Pigs Get Sick: Multi-Agent AI for Swine Disease Detection0
World Models in Artificial Intelligence: Sensing, Learning, and Reasoning Like a Child0
Multi-Agent Actor-Critic with Harmonic Annealing Pruning for Dynamic Spectrum Access Systems0
VIPER: Visual Perception and Explainable Reasoning for Sequential Decision-Making0
Diffusion-Based Forecasting for Uncertainty-Aware Model Predictive Control0
Dynamic Accumulated Attention Map for Interpreting Evolution of Decision-Making in Vision TransformerCode0
Binary AddiVortes: (Bayesian) Additive Voronoi Tessellations for Binary Classification with an application to Predicting Home Mortgage Application Outcomes0
Controlling Peak Sharpness in Multimodal Biomolecular Systems via the Chemical Fokker-Planck Equation0
These Magic Moments: Differentiable Uncertainty Quantification of Radiance Field Models0
VisEscape: A Benchmark for Evaluating Exploration-driven Decision-making in Virtual Escape RoomsCode1
Empowering LLMs in Decision Games through Algorithmic Data Synthesis0
MoK-RAG: Mixture of Knowledge Paths Enhanced Retrieval-Augmented Generation for Embodied AI Environments0
Predicting Human Choice Between Textually Described Lotteries0
A Parallel Hybrid Action Space Reinforcement Learning Model for Real-world Adaptive Traffic Signal ControlCode0
RAD: Retrieval-Augmented Decision-Making of Meta-Actions with Vision-Language Models in Autonomous Driving0
Stochastic Trajectory Prediction under Unstructured Constraints0
ADAPT: An Autonomous Forklift for Construction Site Operation0
A Modular Edge Device Network for Surgery Digitalization0
Synchronous vs Asynchronous Reinforcement Learning in a Real World Robot0
Uncovering Utility Functions from Observed Outcomes0
Robust Decision-Making Via Free Energy Minimization0
From Autonomous Agents to Integrated Systems, A New Paradigm: Orchestrated Distributed Intelligence0
Statistical Inference for Weighted Sample Average Approximation in Contextual Stochastic Optimization0
Optimal compound downselection to promote diversity and parallel chemistry0
Local-Global Learning of Interpretable Control Policies: The Interface between MPC and Reinforcement Learning0
MAP: Evaluation and Multi-Agent Enhancement of Large Language Models for Inpatient Pathways0
A Comprehensive Survey on Multi-Agent Cooperative Decision-Making: Scenarios, Approaches, Challenges and Perspectives0
Leveraging the Dynamics of Leadership in Group Recommendation Systems0
A Circular Construction Product Ontology for End-of-Life Decision-Making0
Identifying Cooperative Personalities in Multi-agent Contexts through Personality Steering with Representation Engineering0
Show:102550
← PrevPage 14 of 247Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified