| Self Reward Design with Fine-grained Interpretability | Dec 30, 2021 | Deep Reinforcement LearningFairness | CodeCode Available | 0 |
| Multiagent Model-based Credit Assignment for Continuous Control | Dec 27, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning | Dec 14, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Continuous Control With Ensemble Deep Deterministic Policy Gradients | Nov 30, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Uncertainty-aware Low-Rank Q-Matrix Estimation for Deep Reinforcement Learning | Nov 19, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Aggressive Q-Learning with Ensembles: Achieving Both High Sample Efficiency and High Asymptotic Performance | Nov 17, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Improving Learning from Demonstrations by Learning from Experience | Nov 16, 2021 | Imitation LearningMuJoCo | —Unverified | 0 |
| GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving | Nov 16, 2021 | Autonomous DrivingCARLA MAP Leaderboard | —Unverified | 0 |
| V-MAO: Generative Modeling for Multi-Arm Manipulation of Articulated Objects | Nov 7, 2021 | MuJoCoObject | —Unverified | 0 |
| Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods | Nov 6, 2021 | MuJoCoPolicy Gradient Methods | CodeCode Available | 0 |