| Hierarchical Reinforcement Learning of Locomotion Policies in Response to Approaching Objects: A Preliminary Study | Mar 20, 2022 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Safe adaptation in multiagent competition | Mar 14, 2022 | MuJoCo | —Unverified | 0 |
| Context is Everything: Implicit Identification for Dynamics Adaptation | Mar 10, 2022 | MuJoCo | —Unverified | 0 |
| AutoDIME: Automatic Design of Interesting Multi-Agent Environments | Mar 4, 2022 | DiagnosticMuJoCo | —Unverified | 0 |
| A Recurrent Differentiable Engine for Modeling Tensegrity Robots Trainable with Low-Frequency Data | Feb 28, 2022 | MuJoCo | —Unverified | 0 |
| User-Oriented Robust Reinforcement Learning | Feb 15, 2022 | MuJoCoreinforcement-learning | —Unverified | 0 |
| DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning | Jan 31, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| STOPS: Short-Term-based Volatility-controlled Policy Search and its Global Convergence | Jan 24, 2022 | MuJoCo | —Unverified | 0 |
| Recursive Least Squares Advantage Actor-Critic Algorithms | Jan 15, 2022 | Computational Efficiencycontinuous-control | —Unverified | 0 |
| Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning | Jan 14, 2022 | modelMuJoCo | —Unverified | 0 |