SOTAVerified

Sequential Decision Making

Papers

Showing 551600 of 1210 papers

TitleStatusHype
Beyond Adaptive Submodularity: Approximation Guarantees of Greedy Policy with Adaptive Submodularity Ratio0
Effective Reward Specification in Deep Reinforcement Learning0
Between Rate-Distortion Theory & Value Equivalence in Model-Based Reinforcement Learning0
A Near-Optimal Best-of-Both-Worlds Algorithm for Online Learning with Feedback Graphs0
Adaptive Rollout Length for Model-Based RL Using Model-Free Deep RL0
Effective Dimension in Bandit Problems under Censorship0
BeTAIL: Behavior Transformer Adversarial Imitation Learning from Human Racing Gameplay0
EARL-BO: Reinforcement Learning for Multi-Step Lookahead, High-Dimensional Bayesian Optimization0
Beta DVBF: Learning State-Space Models for Control from High Dimensional Observations0
An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits0
Dynamics Generalization via Information Bottleneck in Deep Reinforcement Learning0
Dynamic Decision Making for Graphical Models Applied to Oil Exploration0
An Anytime Algorithm for Task and Motion MDPs0
Adaptive Robust Online Portfolio Selection0
Dynamic Bi-Objective Routing of Multiple Vehicles0
Be Considerate: Objectives, Side Effects, and Deciding How to Act0
Bayesian optimization explains human active search0
An Analysis of Frame-skipping in Reinforcement Learning0
DriveGPT: Scaling Autoregressive Behavior Models for Driving0
Doubly Robust Policy Evaluation and Optimization0
Bayesian learning of the optimal action-value function in a Markov decision process0
Doubly Robust Off-policy Value Evaluation for Reinforcement Learning0
Bayesian Inverse Transition Learning for Offline Settings0
Analysis of Thompson Sampling for Partially Observable Contextual Multi-Armed Bandits0
Adaptive Learning Rate for Follow-the-Regularized-Leader: Competitive Analysis and Best-of-Both-Worlds0
A Contextual Bandit Approach for Stream-Based Active Learning0
DOPL: Direct Online Preference Learning for Restless Bandits with Preference Feedback0
Don't Watch Me: A Spatio-Temporal Trojan Attack on Deep-Reinforcement-Learning-Augment Autonomous Driving0
Bayesian Graph Traversal0
Do LLMs Play Dice? Exploring Probability Distribution Sampling in Large Language Models for Behavioral Simulation0
Federated Learning with Uncertainty via Distilled Predictive Distributions0
Divide-and-Conquer Monte Carlo Tree Search0
Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning0
Bayesian Exploration Networks0
Distributional Robustness and Regularization in Reinforcement Learning0
Distributed Optimization via Kernelized Multi-armed Bandits0
Bayesian decision-making under misspecified priors with applications to meta-learning0
A naive aggregation algorithm for improving generalization in a class of learning problems0
Adaptive Sampling for Discovery0
Distributed Online Learning in Social Recommender Systems0
Distributed Multi-Objective Dynamic Offloading Scheduling for Air-Ground Cooperative MEC0
Learning "What-if" Explanations for Sequential Decision-Making0
Distributed Learning: Sequential Decision Making in Resource-Constrained Environments0
Distributed Deep Reinforcement Learning: A Survey and A Multi-Player Multi-Agent Learning Toolbox0
Batched Nonparametric Bandits via k-Nearest Neighbor UCB0
An advantage based policy transfer algorithm for reinforcement learning with measures of transferability0
Discovering an Aid Policy to Minimize Student Evasion Using Offline Reinforcement Learning0
Direct and indirect reinforcement learning0
Batched Neural Bandits0
Adaptive Frontier Exploration on Graphs with Applications to Network-Based Disease Testing0
Show:102550
← PrevPage 12 of 25Next →

No leaderboard results yet.