Learning the Linear Quadratic Regulator from Nonlinear Observations Oct 8, 2020 continuous-control Continuous Control
— Unverified 0Reinforcement Learning for Many-Body Ground-State Preparation Inspired by Counterdiabatic Driving Oct 7, 2020 continuous-control Continuous Control
— Unverified 0Reinforcement Learning with Random Delays Oct 6, 2020 Anatomy continuous-control
Code Code Available 1Learning Diverse Options via InfoMax Termination Critic Oct 6, 2020 Continuous Control Diversity
Code Code Available 0My Body is a Cage: the Role of Morphology in Graph-Based Incompatible Control Oct 5, 2020 continuous-control Continuous Control
Code Code Available 1Heteroscedastic Bayesian Optimisation for Stochastic Model Predictive Control Oct 1, 2020 Bayesian Optimisation continuous-control
— Unverified 0Bridging the gap between Markowitz planning and deep reinforcement learning Sep 30, 2020 Asset Management Autonomous Driving
— Unverified 0Neural Lyapunov Model Predictive Control Sep 28, 2020 continuous-control Continuous Control
— Unverified 0Adaptive Discretization for Continuous Control using Particle Filtering Policy Network Sep 28, 2020 continuous-control Continuous Control
— Unverified 0What About Taking Policy as Input of Value Function: Policy-extended Value Function Approximator Sep 28, 2020 continuous-control Continuous Control
— Unverified 0Autonomous Learning of Features for Control: Experiments with Embodied and Situated Agents Sep 15, 2020 continuous-control Continuous Control
— Unverified 0Multi-Agent Reinforcement Learning in Cournot Games Sep 14, 2020 continuous-control Continuous Control
— Unverified 0DyNODE: Neural Ordinary Differential Equations for Dynamics Modeling in Continuous Control Sep 9, 2020 continuous-control Continuous Control
Code Code Available 1Visualizing the Loss Landscape of Actor Critic Methods with Applications in Inventory Optimization Sep 4, 2020 continuous-control Continuous Control
— Unverified 0On the model-based stochastic value gradient for continuous reinforcement learning Aug 28, 2020 Continuous Control Humanoid Control
Code Code Available 1Learning Off-Policy with Online Planning Aug 23, 2020 ARC Continuous Control
Code Code Available 1ReLMoGen: Leveraging Motion Generation in Reinforcement Learning for Mobile Manipulation Aug 18, 2020 continuous-control Continuous Control
— Unverified 0Overcoming Model Bias for Robust Offline Deep Reinforcement Learning Aug 12, 2020 continuous-control Continuous Control
— Unverified 0Contrastive Variational Reinforcement Learning for Complex Observations Aug 6, 2020 Atari Games Continuous Control
Code Code Available 1ClipUp: A Simple and Powerful Optimizer for Distribution-based Policy Evolution Aug 5, 2020 continuous-control Continuous Control
Code Code Available 1Proximal Deterministic Policy Gradient Aug 3, 2020 continuous-control Continuous Control
— Unverified 0Statistical Bootstrapping for Uncertainty Estimation in Off-Policy Evaluation Jul 27, 2020 continuous-control Continuous Control
— Unverified 0Learning Compositional Neural Programs for Continuous Control Jul 27, 2020 continuous-control Continuous Control
— Unverified 0Predictive Information Accelerates Learning in RL Jul 24, 2020 continuous-control Continuous Control
Code Code Available 1Understanding and Mitigating the Limitations of Prioritized Experience Replay Jul 19, 2020 Autonomous Driving continuous-control
Code Code Available 0Control as Hybrid Inference Jul 11, 2020 continuous-control Continuous Control
— Unverified 0Task-Agnostic Exploration via Policy Gradient of a Non-Parametric State Entropy Estimate Jul 9, 2020 continuous-control Continuous Control
Code Code Available 1Lipschitzness Is All You Need To Tame Off-policy Generative Adversarial Imitation Learning Jun 28, 2020 All Continuous Control
Code Code Available 0Deep Bayesian Quadrature Policy Optimization Jun 28, 2020 continuous-control Continuous Control
Code Code Available 1DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning Jun 26, 2020 continuous-control Continuous Control
— Unverified 0Off-Dynamics Reinforcement Learning: Training for Transfer with Domain Classifiers Jun 24, 2020 continuous-control Continuous Control
Code Code Available 0dm_control: Software and Tasks for Continuous Control Jun 22, 2020 continuous-control Continuous Control
— Unverified 0Information Theoretic Regret Bounds for Online Nonlinear Control Jun 22, 2020 continuous-control Continuous Control
Code Code Available 0Towards Tractable Optimism in Model-Based Reinforcement Learning Jun 21, 2020 continuous-control Continuous Control
— Unverified 0Generating Adjacency-Constrained Subgoals in Hierarchical Reinforcement Learning Jun 20, 2020 continuous-control Continuous Control
Code Code Available 1WD3: Taming the Estimation Bias in Deep Reinforcement Learning Jun 18, 2020 continuous-control Continuous Control
— Unverified 0Reparameterized Variational Divergence Minimization for Stable Imitation Jun 18, 2020 continuous-control Continuous Control
— Unverified 0COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning Jun 16, 2020 Autonomous Vehicles Collision Avoidance
— Unverified 0Data Driven Control with Learned Dynamics: Model-Based versus Model-Free Approach Jun 16, 2020 continuous-control Continuous Control
— Unverified 0Parameter-Based Value Functions Jun 16, 2020 continuous-control Continuous Control
Code Code Available 0Model-based Adversarial Meta-Reinforcement Learning Jun 16, 2020 continuous-control Continuous Control
Code Code Available 1Analytic Manifold Learning: Unifying and Evaluating Representations for Continuous Control Jun 15, 2020 continuous-control Continuous Control
Code Code Available 1Diversity Policy Gradient for Sample Efficient Quality-Diversity Optimization Jun 15, 2020 continuous-control Continuous Control
— Unverified 0Provably Efficient Model-based Policy Adaptation Jun 14, 2020 continuous-control Continuous Control
— Unverified 0Online Hyper-parameter Tuning in Off-policy Learning via Evolutionary Strategies Jun 13, 2020 continuous-control Continuous Control
— Unverified 0Lifelong Learning of Factored Policies via Policy Gradients Jun 12, 2020 continuous-control Continuous Control
— Unverified 0Skill Discovery for Exploration and Planning using Deep Skill Graphs Jun 12, 2020 continuous-control Continuous Control
— Unverified 0A Policy Gradient Method for Task-Agnostic Exploration Jun 12, 2020 continuous-control Continuous Control
Code Code Available 1Self-Imitation Learning via Generalized Lower Bound Q-learning Jun 12, 2020 continuous-control Continuous Control
— Unverified 0Continuous Control for Searching and Planning with a Learned Model Jun 12, 2020 continuous-control Continuous Control
— Unverified 0