| Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout Adaption | May 29, 2024 | modelModel-based Reinforcement Learning | CodeCode Available | 0 |
| Imitating from auxiliary imperfect demonstrations via Adversarial Density Weighted Regression | May 28, 2024 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| A Pontryagin Perspective on Reinforcement Learning | May 28, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model Scales | May 27, 2024 | Atari GamesMuJoCo | CodeCode Available | 0 |
| Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement Learning | May 27, 2024 | Gym halfcheetah-mediumGym halfcheetah-medium-expert | CodeCode Available | 2 |
| Diffusion-based Reinforcement Learning via Q-weighted Variational Policy Optimization | May 25, 2024 | continuous-controlContinuous Control | CodeCode Available | 2 |
| Adaptive Q-Network: On-the-fly Target Selection for Deep Reinforcement Learning | May 25, 2024 | Atari GamesAutoML | —Unverified | 0 |
| Diffusion Actor-Critic with Entropy Regulator | May 24, 2024 | Decision MakingMuJoCo | CodeCode Available | 2 |
| Variational Delayed Policy Optimization | May 23, 2024 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 0 |
| Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow | May 22, 2024 | IngenuityMuJoCo | CodeCode Available | 1 |