SOTAVerified

Decision Making Under Uncertainty

Papers

Showing 150 of 263 papers

TitleStatusHype
Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards0
Toward Explainable Offline RL: Analyzing Representations in Intrinsically Motivated Decision Transformers0
The Gittins Index: A Design Principle for Decision-Making Under Uncertainty0
Q-ARDNS-Multi: A Multi-Agent Quantum Reinforcement Learning Framework with Meta-Cognitive Adaptation for Complex 3D Environments0
Cognitive Guardrails for Open-World Decision Making in Autonomous Drone Swarms0
Embodied AI with Foundation Models for Mobile Service Robots: A Systematic Review0
Synthetic Time Series Forecasting with Transformer Architectures: Extensive Simulation BenchmarksCode0
Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit TasksCode1
rfPG: Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs0
A Multi-Agent Reinforcement Learning Approach for Cooperative Air-Ground-Human Crowdsensing in Emergency Rescue0
Recovering Event Probabilities from Large Language Model Embeddings via Axiomatic Constraints0
ICNN-enhanced 2SP: Leveraging input convex neural networks for solving two-stage stochastic programmingCode0
Primal-dual algorithm for contextual stochastic combinatorial optimization0
Learning Symbolic Persistent Macro-Actions for POMDP Solving Over Time0
LLM-Guided Probabilistic Program Induction for POMDP Model Estimation0
Measures of Variability for Risk-averse Policy Gradient0
Wasserstein Distributionally Robust Regret Optimization0
Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making0
Efficient Portfolio Selection through Preference Aggregation with Quicksort and the Bradley--Terry Model0
Counterfactual Inference under Thompson Sampling0
A friendly introduction to triangular transportCode1
Active Inference for Energy Control and Planning in Smart Buildings and Communities0
NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis PredictionCode0
Truthful Elicitation of Imprecise Forecasts0
Statistical Inference for Weighted Sample Average Approximation in Contextual Stochastic Optimization0
Local-Global Learning of Interpretable Control Policies: The Interface between MPC and Reinforcement Learning0
A Weighted Predict-and-Optimize Framework for Power System Operation Considering Varying Impacts of Uncertainty0
Hierarchical Neuro-Symbolic Decision Transformer0
Quantum-Inspired Reinforcement Learning in the Presence of Epistemic Ambivalence0
Map Space Belief Prediction for Manipulation-Enhanced Mapping0
Does Knowledge About Perceptual Uncertainty Help an Agent in Automated Driving?0
Learning Fair Policies for Infectious Diseases Mitigation using Path Integral Control0
Words or Numbers? How Framing Uncertainties Affects Risk Assessment and Decision-Making0
Online Clustering of Dueling Bandits0
Anytime Incremental ρPOMDP Planning in Continuous Spaces0
A Model-free Biomimetics Algorithm for Deterministic Partially Observable Markov Decision Process0
Online MDP with Transition Prototypes: A Robust Adaptive Approach0
Selective Reviews of Bandit Problems in AI via a Statistical View0
Decision Making under the Exponential Family: Distributionally Robust Optimisation with Bayesian Ambiguity Sets0
Explainable Finite-Memory Policies for Partially Observable Markov Decision Processes0
Fair Secretaries with Unfair Predictions0
Hierarchical Upper Confidence Bounds for Constrained Online Learning0
System-Level Analysis of Module Uncertainty Quantification in the Autonomy Pipeline0
Burning RED: Unlocking Subtask-Driven Reinforcement Learning and Risk-Awareness in Average-Reward Markov Decision Processes0
Crafting desirable climate trajectories with RL explored socio-environmental simulationsCode0
EVOLvE: Evaluating and Optimizing LLMs For Exploration0
Functional Clustering of Discount Functions for Behavioral Investor Profiling0
End-to-End Conformal Calibration for Optimization Under UncertaintyCode1
Mitigating optimistic bias in entropic risk estimation and optimization with an application to insurance0
Rao-Blackwellized POMDP Planning0
Show:102550
← PrevPage 1 of 6Next →

No leaderboard results yet.