| ROER: Regularized Optimal Experience Replay | Jul 4, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Memory Sequence Length of Data Sampling Impacts the Adaptation of Meta-Reinforcement Learning Agents | Jun 18, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Robust Model-Based Reinforcement Learning with an Adversarial Auxiliary Model | Jun 14, 2024 | Board Gamesmodel | CodeCode Available | 0 |
| Residual Learning and Context Encoding for Adaptive Offline-to-Online Reinforcement Learning | Jun 12, 2024 | D4RLMuJoCo | CodeCode Available | 0 |
| RRLS : Robust Reinforcement Learning Suite | Jun 12, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Learning Reward and Policy Jointly from Demonstration and Preference Improves Alignment | Jun 11, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement Learning | Jun 7, 2024 | Contrastive LearningMeta Reinforcement Learning | CodeCode Available | 1 |
| DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays | Jun 5, 2024 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Value Improved Actor Critic Algorithms | Jun 3, 2024 | MuJoCo | —Unverified | 0 |
| Enhancing Efficiency of Safe Reinforcement Learning via Sample Manipulation | May 31, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 5 |