| Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods | Nov 6, 2021 | MuJoCoPolicy Gradient Methods | CodeCode Available | 0 |
| Smooth Imitation Learning via Smooth Costs and Smooth Policies | Nov 3, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Conditioning Sparse Variational Gaussian Processes for Online Decision-making | Oct 28, 2021 | Active LearningDecision Making | CodeCode Available | 1 |
| Policy Search using Dynamic Mirror Descent MPC for Model Free Off Policy RL | Oct 23, 2021 | Model Predictive ControlMuJoCo | —Unverified | 0 |
| CIM-PPO:Proximal Policy Optimization with Liu-Correntropy Induced Metric | Oct 20, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Balancing Value Underestimation and Overestimation with Realistic Actor-Critic | Oct 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Wasserstein Unsupervised Reinforcement Learning | Oct 15, 2021 | Hierarchical Reinforcement LearningMuJoCo | —Unverified | 0 |
| On-Policy Model Errors in Reinforcement Learning | Oct 15, 2021 | modelMuJoCo | —Unverified | 0 |
| Theoretically Principled Deep RL Acceleration via Nearest Neighbor Function Approximation | Oct 9, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Multi-Agent Constrained Policy Optimisation | Oct 6, 2021 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |