| Imitation Learning from Purified Demonstrations | Oct 11, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates | Oct 9, 2023 | MuJoCo | —Unverified | 0 |
| FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation with Parameter-Sharing Versatility | Oct 8, 2023 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| On Representation Complexity of Model-based and Model-free Reinforcement Learning | Oct 3, 2023 | modelMuJoCo | —Unverified | 0 |
| CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture | Sep 28, 2023 | Imitation LearningMuJoCo | —Unverified | 0 |
| Adapting Double Q-Learning for Continuous Reinforcement Learning | Sep 25, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Iterative Reachability Estimation for Safe Reinforcement Learning | Sep 24, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling | Sep 20, 2023 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| Text2Reward: Reward Shaping with Language Models for Reinforcement Learning | Sep 20, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 2 |
| A Bayesian Approach to Robust Inverse Reinforcement Learning | Sep 15, 2023 | Imitation LearningMuJoCo | CodeCode Available | 1 |