| Regulatory Focus: Promotion and Prevention Inclinations in Policy Search | Sep 25, 2019 | Atari Gamescontinuous-control | —Unverified | 0 |
| Risk Averse Value Expansion for Sample Efficient and Robust Policy Learning | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| CrossNorm: On Normalization for Off-Policy Reinforcement Learning | Sep 25, 2019 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Stabilizing Off-Policy Reinforcement Learning with Conservative Policy Gradients | Sep 25, 2019 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Bootstrapping the Expressivity with Model-based Planning | Sep 25, 2019 | modelMuJoCo | CodeCode Available | 0 |
| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Towards Simplicity in Deep Reinforcement Learning: Streamlined Off-Policy Learning | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Latent Representations for Inverse Dynamics using Generalized Experiences | Sep 25, 2019 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Safe Policy Learning for Continuous Control | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Multi-task Batch Reinforcement Learning with Metric Learning | Sep 25, 2019 | Meta Reinforcement LearningMetric Learning | —Unverified | 0 |
| MDP Playground: An Analysis and Debug Testbed for Reinforcement Learning | Sep 17, 2019 | MuJoCoOpenAI Gym | CodeCode Available | 0 |
| Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning | Sep 17, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Biased Estimates of Advantages over Path Ensembles | Sep 15, 2019 | Atari Gamescontinuous-control | —Unverified | 0 |
| Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space | Sep 15, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Mutual-Information Regularization in Markov Decision Processes and Actor-Critic Learning | Sep 11, 2019 | MuJoCoQ-Learning | —Unverified | 0 |
| Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning | Sep 7, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Skill Transfer in Deep Reinforcement Learning under Morphological Heterogeneity | Aug 14, 2019 | DecoderDeep Reinforcement Learning | —Unverified | 0 |
| Towards Model-based Reinforcement Learning for Industry-near Environments | Jul 27, 2019 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 0 |
| A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment | Jul 26, 2019 | MuJoCoReinforcement Learning | —Unverified | 0 |
| Learning Policies through Quantile Regression | Jun 27, 2019 | MuJoCoquantile regression | —Unverified | 0 |
| ORRB -- OpenAI Remote Rendering Backend | Jun 26, 2019 | MuJoCo | CodeCode Available | 0 |
| Exploring Model-based Planning with Policy Networks | Jun 20, 2019 | Benchmarkingmodel | CodeCode Available | 0 |
| Calibrated Model-Based Deep Reinforcement Learning | Jun 19, 2019 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| Reward Prediction Error as an Exploration Objective in Deep RL | Jun 19, 2019 | Atari GamesContinuous Control | —Unverified | 0 |
| Robust Reinforcement Learning for Continuous Control with Model Misspecification | Jun 18, 2019 | continuous-controlContinuous Control | —Unverified | 0 |