| Diverse Imitation Learning via Self-OrganizingGenerative Models | Sep 29, 2021 | Imitation LearningMuJoCo | —Unverified | 0 |
| Evaluating Robustness of Cooperative MARL | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Fight fire with fire: countering bad shortcuts in imitation learning with good shortcuts | Sep 29, 2021 | Autonomous Drivingcontinuous-control | —Unverified | 0 |
| Hypothesis Driven Coordinate Ascent for Reinforcement Learning | Sep 29, 2021 | MuJoCoOpenAI Gym | —Unverified | 0 |
| Maximizing Ensemble Diversity in Deep Reinforcement Learning | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 |
| OVD-Explorer: A General Information-theoretic Exploration Approach for Reinforcement Learning | Sep 29, 2021 | MuJoCoreinforcement-learning | —Unverified | 0 |
| SPP-RL: State Planning Policy Reinforcement Learning | Sep 29, 2021 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration | Sep 27, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience | Sep 24, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients | Sep 24, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |