| Learning Intrinsic Symbolic Rewards in Reinforcement Learning | Oct 8, 2020 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Reinforcement Learning with Random Delays | Oct 6, 2020 | Anatomycontinuous-control | CodeCode Available | 1 |
| FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning | Oct 4, 2020 | GPUMuJoCo | CodeCode Available | 1 |
| What About Taking Policy as Input of Value Function: Policy-extended Value Function Approximator | Sep 28, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Population-Guided Imitation Learning | Sep 27, 2020 | Atari GamesImitation Learning | —Unverified | 0 |
| robosuite: A Modular Simulation Framework and Benchmark for Robot Learning | Sep 25, 2020 | Gesture GenerationMuJoCo | CodeCode Available | 2 |
| Revisiting Design Choices in Proximal Policy Optimization | Sep 23, 2020 | MuJoCo | CodeCode Available | 1 |
| Soft policy optimization using dual-track advantage estimator | Sep 15, 2020 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Sample-Efficient Automated Deep Reinforcement Learning | Sep 3, 2020 | Deep Reinforcement LearningHyperparameter Optimization | CodeCode Available | 1 |
| Constrained Markov Decision Processes via Backward Value Functions | Aug 26, 2020 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |