| Value Gradient weighted Model-Based Reinforcement Learning | Apr 4, 2022 | modelModel-based Reinforcement Learning | CodeCode Available | 1 |
| Hierarchical Reinforcement Learning of Locomotion Policies in Response to Approaching Objects: A Preliminary Study | Mar 20, 2022 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Safe adaptation in multiagent competition | Mar 14, 2022 | MuJoCo | —Unverified | 0 |
| Context is Everything: Implicit Identification for Dynamics Adaptation | Mar 10, 2022 | MuJoCo | —Unverified | 0 |
| AutoDIME: Automatic Design of Interesting Multi-Agent Environments | Mar 4, 2022 | DiagnosticMuJoCo | —Unverified | 0 |
| A Recurrent Differentiable Engine for Modeling Tensegrity Robots Trainable with Low-Frequency Data | Feb 28, 2022 | MuJoCo | —Unverified | 0 |
| User-Oriented Robust Reinforcement Learning | Feb 15, 2022 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Deconstructing the Inductive Biases of Hamiltonian Neural Networks | Feb 10, 2022 | MuJoCo | CodeCode Available | 1 |
| Lipschitz-constrained Unsupervised Skill Discovery | Feb 2, 2022 | DiversityMuJoCo | CodeCode Available | 1 |
| DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning | Jan 31, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| STOPS: Short-Term-based Volatility-controlled Policy Search and its Global Convergence | Jan 24, 2022 | MuJoCo | —Unverified | 0 |
| Recursive Least Squares Advantage Actor-Critic Algorithms | Jan 15, 2022 | Computational Efficiencycontinuous-control | —Unverified | 0 |
| Comparing Model-free and Model-based Algorithms for Offline Reinforcement Learning | Jan 14, 2022 | modelMuJoCo | —Unverified | 0 |
| SimSR: Simple Distance-based State Representation for Deep Reinforcement Learning | Dec 31, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Self Reward Design with Fine-grained Interpretability | Dec 30, 2021 | Deep Reinforcement LearningFairness | CodeCode Available | 0 |
| Multiagent Model-based Credit Assignment for Continuous Control | Dec 27, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning | Dec 14, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| OstrichRL: A Musculoskeletal Ostrich Simulation to Study Bio-mechanical Locomotion | Dec 11, 2021 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Residual Pathway Priors for Soft Equivariance Constraints | Dec 2, 2021 | MuJoCo | CodeCode Available | 1 |
| Offline Model-based Adaptable Policy Learning | Dec 1, 2021 | Decision Makingmodel | CodeCode Available | 1 |
| EDGE: Explaining Deep Reinforcement Learning Policies | Dec 1, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning | Dec 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 |
| Continuous Control With Ensemble Deep Deterministic Policy Gradients | Nov 30, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Generalized Decision Transformer for Offline Hindsight Information Matching | Nov 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning | Nov 19, 2021 | continuous-controlContinuous Control | —Unverified | 0 |