SOTAVerified

Sequential Decision Making

Papers

Showing 601650 of 1210 papers

TitleStatusHype
Blessing from Human-AI Interaction: Super Reinforcement Learning in Confounded Environments0
Modeling driver's evasive behavior during safety-critical lane changes:Two-dimensional time-to-collision and deep reinforcement learning0
Optimistic MLE -- A Generic Model-based Algorithm for Partially Observable Sequential Decision Making0
Reinforcement Learning with Non-Exponential Discounting0
On Efficient Online Imitation Learning via Classification0
Deep Reinforcement Learning for Adaptive Mesh Refinement0
Non-monotonic Resource Utilization in the Bandits with Knapsacks ProblemCode0
Graph Neural Networks for Multi-Robot Active Information Acquisition0
SCALES: From Fairness Principles to Constrained Decision-MakingCode0
Batch Bayesian optimisation via density-ratio estimation with guaranteesCode0
Thompson Sampling with Virtual Helping Agents0
A Survey on Large-Population Systems and Scalable Multi-Agent Reinforcement Learning0
Sequential Information Design: Learning to Persuade in the Dark0
MetaTrader: An Reinforcement Learning Approach Integrating Diverse Policies for Portfolio Optimization0
Federated Online Clustering of BanditsCode0
JARVIS: A Neuro-Symbolic Commonsense Reasoning Framework for Conversational Embodied Agents0
Entropy Regularization for Population Estimation0
Sampling Through the Lens of Sequential Decision Making0
Streaming Adaptive Submodular Maximization0
Understanding the stochastic dynamics of sequential decision-making processes: A path-integral analysis of multi-armed bandits0
Deep VULMAN: A Deep Reinforcement Learning-Enabled Cyber Vulnerability Management Framework0
A Bayesian Approach to Learning Bandit Structure in Markov Decision Processes0
Sample-efficient Safe Learning for Online Nonlinear Control with Control Barrier Functions0
Branch Ranking for Efficient Mixed-Integer Programming via Offline Ranking-based Policy Learning0
Partial-Monotone Adaptive Submodular Maximization0
Adaptive Asynchronous Control Using Meta-learned Neural Ordinary Differential Equations0
High dimensional stochastic linear contextual bandit with missing covariates0
Learn Continuously, Act Discretely: Hybrid Action-Space Reinforcement Learning For Optimal Execution0
Strategising template-guided needle placement for MR-targeted prostate biopsy0
Delayed Feedback in Generalised Linear Bandits Revisited0
Online Learning with Off-Policy Feedback0
Guaranteed Discovery of Control-Endogenous Latent States with Multi-Step Inverse Models0
Hindsight Learning for MDPs with Exogenous InputsCode0
Contextual Bandits with Large Action Spaces: Made PracticalCode0
Scaling up ML-based Black-box Planning with Partial STRIPS Models0
Improving saliency models' predictions of the next fixation with humans' intrinsic cost of gaze shifts0
Transformer Neural Processes: Uncertainty-Aware Meta Learning Via Sequence ModelingCode1
Learning Optimal Solutions via an LSTM-Optimization Framework0
Reinforcement Learning Approaches for the Orienteering Problem with Stochastic and Dynamic Release Dates0
Reinforcement Learning Based Dynamic Model Combination for Time Series Forecasting0
Utility Theory for Sequential Decision Making0
Estimating Link Flows in Road Networks with Synthetic Trajectory Data Generation: Reinforcement Learning-based Approaches0
A Survey on Model-based Reinforcement Learning0
Federated Learning with Uncertainty via Distilled Predictive Distributions0
Interactively Learning Preference Constraints in Linear BanditsCode0
Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL0
Convergence and sample complexity of natural policy gradient primal-dual methods for constrained MDPs0
Deciding What to Model: Value-Equivalent Sampling for Reinforcement Learning0
Interpolating Between Softmax Policy Gradient and Neural Replicator Dynamics with Capped Implicit Exploration0
Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning0
Show:102550
← PrevPage 13 of 25Next →

No leaderboard results yet.