SOTAVerified

Sequential Decision Making

Papers

Showing 10011025 of 1210 papers

TitleStatusHype
Thompson Sampling for Contextual Bandit Problems with Auxiliary Safety Constraints0
Thompson Sampling via Local UncertaintyCode0
Policy Learning for Malaria ControlCode0
Adaptive Exploration in Linear Contextual Bandit0
Deep Q-Network for Angry BirdsCode0
MABWiser: A Parallelizable Contextual Multi-Armed Bandit Library for PythonCode0
The Choice Function Framework for Online Policy Improvement0
Reinforcement Learning for Multi-Objective Optimization of Online Decisions in High-Dimensional Systems0
Generalizing Reinforcement Learning to Unseen Actions0
Collaborative Inter-agent Knowledge Distillation for Reinforcement Learning0
Learning Functionally Decomposed Hierarchies for Continuous Navigation Tasks0
PROVABLY BENEFITS OF DEEP HIERARCHICAL RL0
Selective Network Discovery via Deep Reinforcement Learning on Embedded Spaces0
Back to the Future -- Sequential Alignment of Text RepresentationsCode0
Classification with Costly Features as a Sequential Decision-Making ProblemCode0
An Arm-Wise Randomization Approach to Combinatorial Linear Semi-Bandits0
Prediction, Consistency, Curvature: Representation Learning for Locally-Linear ControlCode0
Can A User Anticipate What Her Followers Want?0
Interactive Machine Comprehension with Information Seeking AgentsCode0
Reinforcement Learning in Healthcare: A Survey0
Exploring Offline Policy Evaluation for the Continuous-Armed Bandit Problem0
Online Planning for Decentralized Stochastic Control with Partial History Sharing0
Bridging Commonsense Reasoning and Probabilistic Planning via a Probabilistic Action Language0
Reward Learning for Efficient Reinforcement Learning in Extractive Document SummarisationCode0
Bandit Convex Optimization in Non-stationary Environments0
Show:102550
← PrevPage 41 of 49Next →

No leaderboard results yet.