| Action Robust Reinforcement Learning and Applications in Continuous Control | Jan 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Off-Policy Average Reward Actor-Critic with Deterministic Policy Search | May 20, 2023 | MuJoCo | CodeCode Available | 0 |
| Balancing Value Underestimation and Overestimation with Realistic Actor-Critic | Oct 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Sample Dropout: A Simple yet Effective Variance Reduction Technique in Deep Policy Optimization | Feb 5, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| On Learning Intrinsic Rewards for Policy Gradient Methods | Apr 17, 2018 | Atari GamesDecision Making | CodeCode Available | 0 |
| Application of linear regression method to the deep reinforcement learning in continuous action cases | Mar 19, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Constrained Intrinsic Motivation for Reinforcement Learning | Jul 12, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 0 |
| Mimicking Better by Matching the Approximate Action Distribution | Jun 16, 2023 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Scalable Kernel Inverse Optimization | Oct 31, 2024 | MuJoCo | CodeCode Available | 0 |
| On Rollouts in Model-Based Reinforcement Learning | Jan 28, 2025 | modelModel-based Reinforcement Learning | CodeCode Available | 0 |