SOTAVerified

Decision Making Under Uncertainty

Papers

Showing 101150 of 263 papers

TitleStatusHype
Learning Successor Features with Distributed Hebbian Temporal Memory0
Learning Symbolic Persistent Macro-Actions for POMDP Solving Over Time0
Leaving the Nest: Going Beyond Local Loss Functions for Predict-Then-Optimize0
Legion: Best-First Concolic Testing0
Linear Stochastic Bandits over a Bit-Constrained Channel0
LLM-Guided Probabilistic Program Induction for POMDP Model Estimation0
Local-Global Learning of Interpretable Control Policies: The Interface between MPC and Reinforcement Learning0
Macro-Action-Based Deep Multi-Agent Reinforcement Learning0
Macro-Action-Based Multi-Agent/Robot Deep Reinforcement Learning under Partial Observability0
Map Space Belief Prediction for Manipulation-Enhanced Mapping0
Measures of Variability for Risk-averse Policy Gradient0
Modeling Boundedly Rational Agents with Latent Inference Budgets0
Modelling Dynamic Interactions between Relevance Dimensions0
Multi-Armed Bandits With Machine Learning-Generated Surrogate Rewards0
Multicriteria Group Decision-Making Under Uncertainty Using Interval Data and Cloud Models0
Natural Language Generation enhances human decision-making with uncertain information0
No DBA? No regret! Multi-armed bandits for index tuning of analytical and HTAP workloads with provable guarantees0
(Non-)Commutative Aggregation0
Non-stationary Delayed Combinatorial Semi-Bandit with Causally Related Rewards0
Novel Exploration Techniques (NETs) for Malaria Policy Interventions0
Observation-Augmented Contextual Multi-Armed Bandits for Robotic Search and Exploration0
On Algorithmic Decision Procedures in Emergency Response Systems in Smart and Connected Communities0
On a notion of independence proposed by Teddy Seidenfeld0
On Hashing-Based Approaches to Approximate DNF-Counting0
Online Clustering of Dueling Bandits0
Online MDP with Transition Prototypes: A Robust Adaptive Approach0
Online Planning Algorithms for POMDPs0
Online POMDP Planning with Anytime Deterministic Optimality Guarantees0
On the Expressivity of Multidimensional Markov Reward0
Optimal Immunization Policy Using Dynamic Programming0
Optimal Sensing via Multi-armed Bandit Relaxations in Mixed Observability Domains0
Optimization under Uncertainty in the Era of Big Data and Deep Learning: When Machine Learning Meets Mathematical Programming0
Parallelizing Contextual Bandits0
Partial Law Invariance and Risk Measures0
Partially Observable Stochastic Games with Neural Perception Mechanisms0
Playing against Nature: causal discovery for decision making under uncertainty0
Point-Based Value Iteration for POMDPs with Neural Perception Mechanisms0
Mean-Variance Efficient Reinforcement Learning with Applications to Dynamic Financial Investment0
A Posteriori Probabilistic Bounds of Convex Scenario Programs with Validation Tests0
Practical Bandits: An Industry Perspective0
Learning-to-defer for sequential medical decision-making under uncertainty0
Reference Points, Risk-Taking Behavior, and Competitive Outcomes in Sequential Settings0
Primal-dual algorithm for contextual stochastic combinatorial optimization0
Probabilistic Demand Forecasting with Graph Neural Networks0
Probabilistic Loss and its Online Characterization for Simplified Decision Making Under Uncertainty0
Probability Tools for Sequential Random Projection0
Proofs for the New Definitions in Financial Markets0
Q-ARDNS-Multi: A Multi-Agent Quantum Reinforcement Learning Framework with Meta-Cognitive Adaptation for Complex 3D Environments0
Quantum Circuit Components for Cognitive Decision-Making0
Quantum-Inspired Reinforcement Learning in the Presence of Epistemic Ambivalence0
Show:102550
← PrevPage 3 of 6Next →

No leaderboard results yet.