| On the Perturbed States for Transformed Input-robust Reinforcement Learning | Jul 31, 2024 | DenoisingMuJoCo | CodeCode Available | 0 |
| SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments | Jul 26, 2024 | MuJoCo | CodeCode Available | 0 |
| Maximum Entropy On-Policy Actor-Critic via Entropy Advantage Estimation | Jul 25, 2024 | MuJoCo | —Unverified | 0 |
| Learning Constraint Network from Demonstrations via Positive-Unlabeled Learning with Memory Replay | Jul 23, 2024 | MuJoCo | —Unverified | 0 |
| Proximal Policy Distillation | Jul 21, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Temporal Abstraction in Reinforcement Learning with Offline Data | Jul 21, 2024 | Hierarchical Reinforcement LearningMuJoCo | —Unverified | 0 |
| LLM-Empowered State Representation for Reinforcement Learning | Jul 18, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning | Jul 17, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Constrained Intrinsic Motivation for Reinforcement Learning | Jul 12, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 0 |
| A Review of Nine Physics Engines for Reinforcement Learning Research | Jul 11, 2024 | Decision MakingMuJoCo | —Unverified | 0 |