SOTAVerified

Sequential Decision Making

Papers

Showing 11011150 of 1210 papers

TitleStatusHype
UCBoost: A Boosting Approach to Tame Complexity and Optimality for Stochastic Bandits0
Policy Gradient With Value Function Approximation For Collective Multiagent Planning0
Hindsight is Only 50/50: Unsuitability of MDP based Approximate POMDP Solvers for Multi-resolution Information Gathering0
Accelerating E-Commerce Search Engine Ranking by Contextual Factor Selection0
Hierarchical Imitation and Reinforcement Learning0
Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson SamplingCode0
Novel Approaches to Accelerating the Convergence Rate of Markov Decision Process for Search Result Diversification0
Structured Control Nets for Deep Reinforcement LearningCode0
An Anytime Algorithm for Task and Motion MDPs0
MPC-Inspired Neural Network Policies for Sequential Decision Making0
Decomposition Methods with Deep Corrections for Reinforcement LearningCode0
Understanding Human Behaviors in Crowds by Imitating the Decision-Making Process0
Testing Optimality of Sequential Decision-Making0
Learning Structural Weight Uncertainty for Sequential Decision-MakingCode0
Deep Reinforcement Learning for Unsupervised Video Summarization with Diversity-Representativeness RewardCode0
Multi-shot Pedestrian Re-identification via Sequential Decision Making0
Learning Multi-Level Hierarchies with HindsightCode1
Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes0
SkipNet: Learning Dynamic Routing in Convolutional NetworksCode1
Classification with Costly Features using Deep Reinforcement LearningCode0
Loss Functions for Multiset Prediction0
Servant of Many Masters: Shifting priorities in Pareto-optimal sequential decision-making0
How Should a Robot Assess Risk? Towards an Axiomatic Theory of Risk in Robotics0
Hierarchical State Abstractions for Decision-Making Problems with Computational Constraints0
Asymmetric Actor Critic for Image-Based Robot Learning0
Detecting Adversarial Attacks on Neural Network Policies with Visual ForesightCode0
A2-RL: Aesthetics Aware Reinforcement Learning for Image CroppingCode0
Optimal Learning for Sequential Decision Making for Expensive Cost Functions with Stochastic Binary Feedbacks0
Safety-Aware Algorithms for Adversarial Contextual Bandit0
Non-Stationary Bandits with Habituation and Recovery Dynamics0
Learning for Multi-robot Cooperation in Partially Observable Stochastic Environments with Macro-actions0
Learning model-based planning from scratchCode0
Correlational Dueling Bandits with Application to Clinical Treatment in Large Decision Spaces0
Tableaux for Policy Synthesis for MDPs with PCTL* Constraints0
The Theory is Predictive, but is it Complete? An Application to Human Perception of Randomness0
Unlocking the Potential of Simulators: Design with RL in Mind0
A method for the online construction of the set of states of a Markov Decision Process using Answer Set Programming0
Boltzmann Exploration Done Right0
Thinking Fast and Slow with Deep Learning and Tree SearchCode1
Learning to Mix n-Step Returns: Generalizing lambda-Returns for Deep Reinforcement Learning0
Answer Set Programming for Non-Stationary Markov Decision Processes0
On Improving Deep Reinforcement Learning for POMDPsCode0
Using Reinforcement Learning for Demand Response of Domestic Hot Water Buffers: a Real-Life Demonstration0
Minimizing Maximum Regret in Commitment Constrained Sequential Decision Making0
Deep Robust Kalman Filter0
Deeply AggreVaTeD: Differentiable Imitation Learning for Sequential Prediction0
Active Learning for Accurate Estimation of Linear Models0
Tight Bounds for Bandit Combinatorial Optimization0
Learning to Repeat: Fine Grained Action Repetition for Deep Reinforcement Learning0
Deep Reinforcement Learning for Visual Object Tracking in Videos0
Show:102550
← PrevPage 23 of 25Next →

No leaderboard results yet.