| Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning | Sep 7, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Skill Transfer in Deep Reinforcement Learning under Morphological Heterogeneity | Aug 14, 2019 | DecoderDeep Reinforcement Learning | —Unverified | 0 |
| Towards Model-based Reinforcement Learning for Industry-near Environments | Jul 27, 2019 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 0 |
| A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment | Jul 26, 2019 | MuJoCoReinforcement Learning | —Unverified | 0 |
| Learning Policies through Quantile Regression | Jun 27, 2019 | MuJoCoquantile regression | —Unverified | 0 |
| ORRB -- OpenAI Remote Rendering Backend | Jun 26, 2019 | MuJoCo | CodeCode Available | 0 |
| Exploring Model-based Planning with Policy Networks | Jun 20, 2019 | Benchmarkingmodel | CodeCode Available | 0 |
| Calibrated Model-Based Deep Reinforcement Learning | Jun 19, 2019 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| Reward Prediction Error as an Exploration Objective in Deep RL | Jun 19, 2019 | Atari GamesContinuous Control | —Unverified | 0 |
| Robust Reinforcement Learning for Continuous Control with Model Misspecification | Jun 18, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Language as an Abstraction for Hierarchical Deep Reinforcement Learning | Jun 18, 2019 | Deep Reinforcement LearningInstruction Following | CodeCode Available | 0 |
| Learning Powerful Policies by Using Consistent Dynamics Model | Jun 11, 2019 | Atari Gamesmodel | CodeCode Available | 0 |
| Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning | May 31, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning Efficient and Effective Exploration Policies with Counterfactual Meta Policy | May 28, 2019 | counterfactualEfficient Exploration | —Unverified | 0 |
| Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Search by Target Distribution Learning for Continuous Control | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Imitation Learning from Video by Leveraging Proprioception | May 22, 2019 | Imitation LearningMuJoCo | —Unverified | 0 |
| Evolving Rewards to Automate Reinforcement Learning | May 18, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Leveraging exploration in off-policy algorithms via normalizing flows | May 16, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| P3O: Policy-on Policy-off Policy Optimization | May 5, 2019 | MuJoCoReinforcement Learning | CodeCode Available | 0 |
| Collaborative Evolutionary Reinforcement Learning | May 2, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Composing Complex Skills by Learning Transition Policies with Proximity Reward Induction | May 1, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies | May 1, 2019 | MuJoCo | —Unverified | 0 |
| SUPERVISED POLICY UPDATE | May 1, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation | Apr 14, 2019 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |