Decorrelated Double Q-learning Jun 12, 2020 continuous-control Continuous Control
— Unverified 0Robustness to Adversarial Attacks in Learning-Enabled Controllers Jun 11, 2020 continuous-control Continuous Control
— Unverified 0Zeroth-Order Supervised Policy Improvement Jun 11, 2020 continuous-control Continuous Control
— Unverified 0What Matters In On-Policy Reinforcement Learning? A Large-Scale Empirical Study Jun 10, 2020 Attribute continuous-control
Code Code Available 1AR-DAE: Towards Unbiased Neural Entropy Gradient Estimation Jun 9, 2020 continuous-control Continuous Control
Code Code Available 1Variational Model-based Policy Optimization Jun 9, 2020 continuous-control Continuous Control
— Unverified 0Primal Wasserstein Imitation Learning Jun 8, 2020 continuous-control Continuous Control
Code Code Available 0Conservative Q-Learning for Offline Reinforcement Learning Jun 8, 2020 continuous-control Continuous Control
Code Code Available 1Dual Policy Distillation Jun 7, 2020 continuous-control Continuous Control
Code Code Available 0Prediction and Generalisation over Directed Actions by Grid Cells Jun 5, 2020 continuous-control Continuous Control
Code Code Available 0Wasserstein Distance guided Adversarial Imitation Learning with Reward Shape Exploration Jun 5, 2020 continuous-control Continuous Control
Code Code Available 1Meta-Model-Based Meta-Policy Optimization Jun 4, 2020 continuous-control Continuous Control
— Unverified 0Refined Continuous Control of DDPG Actors via Parametrised Activation Jun 4, 2020 continuous-control Continuous Control
— Unverified 0Deep Reinforcement learning for real autonomous mobile robot navigation in indoor environments May 28, 2020 continuous-control Continuous Control
Code Code Available 1MOPO: Model-based Offline Policy Optimization May 27, 2020 continuous-control Continuous Control
Code Code Available 1Gradient Monitored Reinforcement Learning May 25, 2020 Atari Games continuous-control
— Unverified 0Decentralized Deep Reinforcement Learning for a Distributed and Adaptive Locomotion Controller of a Hexapod Robot May 21, 2020 continuous-control Continuous Control
Code Code Available 1Mirror Descent Policy Optimization May 20, 2020 continuous-control Continuous Control
Code Code Available 1Language Conditioned Imitation Learning over Unstructured Data May 15, 2020 continuous-control Continuous Control
— Unverified 0Unbiased Deep Reinforcement Learning: A General Training Framework for Existing and Future Algorithms May 12, 2020 continuous-control Continuous Control
— Unverified 0Smooth Exploration for Robotic Reinforcement Learning May 12, 2020 continuous-control Continuous Control
Code Code Available 2Delay-Aware Model-Based Reinforcement Learning for Continuous Control May 11, 2020 continuous-control Continuous Control
Code Code Available 1Controlling Overestimation Bias with Truncated Mixture of Continuous Distributional Quantile Critics May 8, 2020 continuous-control Continuous Control
— Unverified 0Off-Policy Adversarial Inverse Reinforcement Learning May 3, 2020 continuous-control Continuous Control
Code Code Available 1Disagreement-Regularized Imitation Learning May 1, 2020 continuous-control Continuous Control
Code Code Available 1Option Discovery using Deep Skill Chaining May 1, 2020 continuous-control Continuous Control
Code Code Available 1Keep Doing What Worked: Behavior Modelling Priors for Offline Reinforcement Learning May 1, 2020 continuous-control Continuous Control
— Unverified 0Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control May 1, 2020 continuous-control Continuous Control
— Unverified 0DSAC: Distributional Soft Actor Critic for Risk-Sensitive Reinforcement Learning Apr 30, 2020 continuous-control Continuous Control
— Unverified 0How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization Apr 29, 2020 continuous-control Continuous Control
Code Code Available 1Image Augmentation Is All You Need: Regularizing Deep Reinforcement Learning from Pixels Apr 28, 2020 All Atari Games 100k
Code Code Available 1Learning to Guide Random Search Apr 25, 2020 Bayesian Optimization continuous-control
Code Code Available 1Conservation Voltage Reduction (CVR) via Two-Timescale Control in Unbalanced Power Distribution Systems Apr 24, 2020 continuous-control Continuous Control
— Unverified 0PBCS : Efficient Exploration and Exploitation Using a Synergy between Reinforcement Learning and Motion Planning Apr 24, 2020 continuous-control Continuous Control
— Unverified 0Divide-and-Conquer Monte Carlo Tree Search For Goal-Directed Planning Apr 23, 2020 continuous-control Continuous Control
— Unverified 0Learning Constrained Adaptive Differentiable Predictive Control Policies With Guarantees Apr 23, 2020 Continuous Control Imitation Learning
Code Code Available 1Continual Reinforcement Learning with Multi-Timescale Replay Apr 16, 2020 Continual Learning continuous-control
Code Code Available 1CURL: Contrastive Unsupervised Representations for Reinforcement Learning Apr 8, 2020 Atari Games Atari Games 100k
Code Code Available 1Uniform State Abstraction For Reinforcement Learning Apr 6, 2020 continuous-control Continuous Control
— Unverified 0Weakly-Supervised Reinforcement Learning for Controllable Behavior Apr 6, 2020 continuous-control Continuous Control
— Unverified 0Intrinsic Exploration as Multi-Objective RL Apr 6, 2020 continuous-control Continuous Control
— Unverified 0Learning Sparse Rewarded Tasks from Sub-Optimal Demonstrations Apr 1, 2020 continuous-control Continuous Control
Code Code Available 0Exploration in Action Space Mar 31, 2020 continuous-control Continuous Control
Code Code Available 0Controllable Person Image Synthesis with Attribute-Decomposed GAN Mar 27, 2020 Attribute continuous-control
Code Code Available 1An empirical investigation of the challenges of real-world reinforcement learning Mar 24, 2020 continuous-control Continuous Control
Code Code Available 1PFPN: Continuous Control of Physically Simulated Characters using Particle Filtering Policy Network Mar 16, 2020 continuous-control Continuous Control
Code Code Available 1Online Meta-Critic Learning for Off-Policy Actor-Critic Methods Mar 11, 2020 continuous-control Continuous Control
Code Code Available 1ABC-LMPC: Safe Sample-Based Learning MPC for Stochastic Nonlinear Dynamical Systems with Adjustable Boundary Conditions Mar 3, 2020 continuous-control Continuous Control
— Unverified 0PlaNet of the Bayesians: Reconsidering and Improving Deep Planning Network by Incorporating Bayesian Inference Mar 1, 2020 Bayesian Inference continuous-control
— Unverified 0Off-Policy Deep Reinforcement Learning with Analogous Disentangled Exploration Feb 25, 2020 continuous-control Continuous Control
Code Code Available 0