SOTAVerified

Decision Making

Papers

Showing 5175 of 12311 papers

TitleStatusHype
Towards AI Search Paradigm0
TransDreamerV3: Implanting Transformer In DreamerV3Code0
UProp: Investigating the Uncertainty Propagation of LLMs in Multi-Step Agentic Decision-MakingCode0
Social Group Bias in AI Finance0
Formal Control for Uncertain Systems via Contract-Based Probabilistic Surrogates (Extended Version)0
Cash or Comfort? How LLMs Value Your InconvenienceCode0
Learning in Random Utility Models Via Online Decision Problems0
CF-Seg: Counterfactuals meet Segmentation0
BIDA: A Bi-level Interaction Decision-making Algorithm for Autonomous Vehicles in Dynamic Traffic Scenarios0
The Role of Explanation Styles and Perceived Accuracy on Decision Making in Predictive Process Monitoring0
Large Language Models are Near-Optimal Decision-Makers with a Non-Human Learning BehaviorCode1
Make Your AUV Adaptive: An Environment-Aware Reinforcement Learning Framework For Underwater Tasks0
An Empirical Study of Bugs in Data Visualization Libraries0
Adaptive Action Duration with Contextual Bandits for Deep Reinforcement Learning in Dynamic EnvironmentsCode0
Preparing for the Intelligence Explosion0
Hypothesis Testing for Quantifying LLM-Human Misalignment in Multiple Choice Settings0
Fragile Preferences: A Deep Dive Into Order Effects in Large Language Models0
Enclosing Prototypical Variational Autoencoder for Explainable Out-of-Distribution Detection0
Situational-Constrained Sequential Resources Allocation via Reinforcement Learning0
Bayesian Hybrid Machine Learning of Gallstone Risk0
ADRD: LLM-Driven Autonomous Driving Based on Rule-based Decision Systems0
Mxplainer: Explain and Learn Insights by Imitating Mahjong AgentsCode0
Automated Decision-Making on Networks with LLMs through Knowledge-Guided Evolution0
Toward Safety-First Human-Like Decision Making for Autonomous Vehicles in Time-Varying Traffic Flow0
Towards Reliable WMH Segmentation under Domain Shift: An Application Study using Maximum Entropy Regularization to Improve Uncertainty Estimation0
Show:102550
← PrevPage 3 of 493Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1SRLAAverage Remaining Cycles6.4Unverified