| Offline Imitation Learning with a Misspecified Simulator | Dec 1, 2020 | Decision MakingFriction | —Unverified | 0 |
| Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUp | Nov 30, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Weighted Entropy Modification for Soft Actor-Critic | Nov 18, 2020 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Proximal Policy Optimization via Enhanced Exploration Efficiency | Nov 11, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Sim2Sim Evaluation of a Novel Data-Efficient Differentiable Physics Engine for Tensegrity Robots | Nov 10, 2020 | MuJoCo | —Unverified | 0 |
| RealAnt: An Open-Source Low-Cost Quadruped for Education and Research in Real-World Reinforcement Learning | Nov 5, 2020 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping | Nov 5, 2020 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Cooperative Heterogeneous Deep Reinforcement Learning | Nov 2, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Can Reinforcement Learning for Continuous Control Generalize Across Physics Engines? | Oct 27, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification | Oct 20, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous Control | Oct 15, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Human-guided Robot Behavior Learning: A GAN-assisted Preference-based Reinforcement Learning Approach | Oct 15, 2020 | Generative Adversarial NetworkMuJoCo | CodeCode Available | 0 |
| Self-Imitation Learning for Robot Tasks with Sparse and Delayed Rewards | Oct 14, 2020 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Balancing Constraints and Rewards with Meta-Gradient D4PG | Oct 13, 2020 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Hindsight Experience Replay with Kronecker Product Approximate Curvature | Oct 9, 2020 | MuJoCo | —Unverified | 0 |
| Learning Intrinsic Symbolic Rewards in Reinforcement Learning | Oct 8, 2020 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Reinforcement Learning with Random Delays | Oct 6, 2020 | Anatomycontinuous-control | CodeCode Available | 1 |
| FORK: A Forward-Looking Actor For Model-Free Reinforcement Learning | Oct 4, 2020 | GPUMuJoCo | CodeCode Available | 1 |
| What About Taking Policy as Input of Value Function: Policy-extended Value Function Approximator | Sep 28, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Population-Guided Imitation Learning | Sep 27, 2020 | Atari GamesImitation Learning | —Unverified | 0 |
| robosuite: A Modular Simulation Framework and Benchmark for Robot Learning | Sep 25, 2020 | Gesture GenerationMuJoCo | CodeCode Available | 2 |
| Revisiting Design Choices in Proximal Policy Optimization | Sep 23, 2020 | MuJoCo | CodeCode Available | 1 |
| Soft policy optimization using dual-track advantage estimator | Sep 15, 2020 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Sample-Efficient Automated Deep Reinforcement Learning | Sep 3, 2020 | Deep Reinforcement LearningHyperparameter Optimization | CodeCode Available | 1 |
| Constrained Markov Decision Processes via Backward Value Functions | Aug 26, 2020 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |