SOTAVerified

Decision Making Under Uncertainty

Papers

Showing 150 of 263 papers

TitleStatusHype
Comparing Exploration-Exploitation Strategies of LLMs and Humans: Insights from Standard Multi-armed Bandit TasksCode1
A friendly introduction to triangular transportCode1
End-to-End Conformal Calibration for Optimization Under UncertaintyCode1
Adaptive Two-Stage Cloud Resource Scaling via Hierarchical Multi-Indicator Forecasting and Bayesian Decision-MakingCode1
Explaining Predictive Uncertainty with Information Theoretic Shapley ValuesCode1
Uncertainty Quantification for Image-based Traffic Prediction across CitiesCode1
Collective Intelligence in Human-AI Teams A Bayesian Theory of Mind ApproachCode1
Bayesian Optimization with Conformal Prediction SetsCode1
jsdp: a Java Stochastic DP LibraryCode1
xView3-SAR: Detecting Dark Fishing Activity Using Synthetic Aperture Radar ImageryCode1
Neur2SP: Neural Two-Stage Stochastic ProgrammingCode1
Deep Reinforcement Learning for Time Allocation and Directional Transmission in Joint Radar-CommunicationCode1
Emulation of physical processes with EmukitCode1
Counterfactual Explanations in Sequential Decision Making Under UncertaintyCode1
Unifying Cardiovascular Modelling with Deep Reinforcement Learning for Uncertainty Aware Control of Sepsis TreatmentCode1
An empirical evaluation of active inference in multi-armed banditsCode1
Encoding the latent posterior of Bayesian Neural Networks for uncertainty quantificationCode1
MAGIC: Learning Macro-Actions for Online POMDP PlanningCode1
Bayesian Optimization of Risk MeasuresCode1
Dynamic Multi-Robot Task Allocation under Uncertainty and Temporal ConstraintsCode1
Curating a COVID-19 data repository and forecasting county-level death counts in the United StatesCode1
Approximate Inference in Discrete Distributions with Monte Carlo Tree Search and Value FunctionsCode1
Reinforcement Learning for Temporal Logic Control Synthesis with Probabilistic Satisfaction GuaranteesCode1
Certified Reinforcement Learning with Logic GuidanceCode1
Logically-Constrained Reinforcement LearningCode1
Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards0
Toward Explainable Offline RL: Analyzing Representations in Intrinsically Motivated Decision Transformers0
The Gittins Index: A Design Principle for Decision-Making Under Uncertainty0
Q-ARDNS-Multi: A Multi-Agent Quantum Reinforcement Learning Framework with Meta-Cognitive Adaptation for Complex 3D Environments0
Cognitive Guardrails for Open-World Decision Making in Autonomous Drone Swarms0
Synthetic Time Series Forecasting with Transformer Architectures: Extensive Simulation BenchmarksCode0
Embodied AI with Foundation Models for Mobile Service Robots: A Systematic Review0
rfPG: Robust Finite-Memory Policy Gradients for Hidden-Model POMDPs0
A Multi-Agent Reinforcement Learning Approach for Cooperative Air-Ground-Human Crowdsensing in Emergency Rescue0
Recovering Event Probabilities from Large Language Model Embeddings via Axiomatic Constraints0
ICNN-enhanced 2SP: Leveraging input convex neural networks for solving two-stage stochastic programmingCode0
Primal-dual algorithm for contextual stochastic combinatorial optimization0
Learning Symbolic Persistent Macro-Actions for POMDP Solving Over Time0
LLM-Guided Probabilistic Program Induction for POMDP Model Estimation0
Wasserstein Distributionally Robust Regret Optimization0
Measures of Variability for Risk-averse Policy Gradient0
Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making0
Efficient Portfolio Selection through Preference Aggregation with Quicksort and the Bradley--Terry Model0
Counterfactual Inference under Thompson Sampling0
Active Inference for Energy Control and Planning in Smart Buildings and Communities0
Truthful Elicitation of Imprecise Forecasts0
NeuroSep-CP-LCB: A Deep Learning-based Contextual Multi-armed Bandit Algorithm with Uncertainty Quantification for Early Sepsis PredictionCode0
Local-Global Learning of Interpretable Control Policies: The Interface between MPC and Reinforcement Learning0
Statistical Inference for Weighted Sample Average Approximation in Contextual Stochastic Optimization0
A Weighted Predict-and-Optimize Framework for Power System Operation Considering Varying Impacts of Uncertainty0
Show:102550
← PrevPage 1 of 6Next →

No leaderboard results yet.