| Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration | Sep 27, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience | Sep 24, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients | Sep 24, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Trust Region Policy Optimisation in Multi-Agent Reinforcement Learning | Sep 23, 2021 | LEMMAMuJoCo | CodeCode Available | 1 |
| Membership Inference Attacks Against Temporally Correlated Data in Deep Reinforcement Learning | Sep 8, 2021 | Adversarial Attackcontinuous-control | —Unverified | 0 |
| Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning | Sep 6, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement Learning with Prior Regularization | Aug 19, 2021 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Settling the Variance of Multi-Agent Policy Gradients | Aug 19, 2021 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| A general class of surrogate functions for stable and efficient reinforcement learning | Aug 12, 2021 | MuJoCoPolicy Gradient Methods | CodeCode Available | 0 |
| A Pragmatic Look at Deep Imitation Learning | Aug 4, 2021 | Behavioural cloningD4RL | —Unverified | 0 |