SOTAVerified

Decision Making Under Uncertainty

Papers

Showing 125 of 263 papers

TitleStatusHype
Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards0
Toward Explainable Offline RL: Analyzing Representations in Intrinsically Motivated Decision Transformers0
The Gittins Index: A Design Principle for Decision-Making Under Uncertainty0
Q-ARDNS-Multi: A Multi-Agent Quantum Reinforcement Learning Framework with Meta-Cognitive Adaptation for Complex 3D Environments0
Cognitive Guardrails for Open-World Decision Making in Autonomous Drone Swarms0
Embodied AI with Foundation Models for Mobile Service Robots: A Systematic Review0
Synthetic Time Series Forecasting with Transformer Architectures: Extensive Simulation BenchmarksCode0
Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit TasksCode1
rfPG: Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs0
A Multi-Agent Reinforcement Learning Approach for Cooperative Air-Ground-Human Crowdsensing in Emergency Rescue0
Recovering Event Probabilities from Large Language Model Embeddings via Axiomatic Constraints0
ICNN-enhanced 2SP: Leveraging input convex neural networks for solving two-stage stochastic programmingCode0
Primal-dual algorithm for contextual stochastic combinatorial optimization0
Learning Symbolic Persistent Macro-Actions for POMDP Solving Over Time0
LLM-Guided Probabilistic Program Induction for POMDP Model Estimation0
Measures of Variability for Risk-averse Policy Gradient0
Wasserstein Distributionally Robust Regret Optimization0
Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making0
Efficient Portfolio Selection through Preference Aggregation with Quicksort and the Bradley--Terry Model0
Counterfactual Inference under Thompson Sampling0
A friendly introduction to triangular transportCode1
Active Inference for Energy Control and Planning in Smart Buildings and Communities0
NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis PredictionCode0
Truthful Elicitation of Imprecise Forecasts0
Local-Global Learning of Interpretable Control Policies: The Interface between MPC and Reinforcement Learning0
Show:102550
← PrevPage 1 of 11Next →

No leaderboard results yet.