| Generalized Maximum Entropy Reinforcement Learning via Reward Shaping | Sep 29, 2021 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Hypothesis Driven Coordinate Ascent for Reinforcement Learning | Sep 29, 2021 | MuJoCoOpenAI Gym | —Unverified | 0 |
| Fight fire with fire: countering bad shortcuts in imitation learning with good shortcuts | Sep 29, 2021 | Autonomous Drivingcontinuous-control | —Unverified | 0 |
| Auto-Encoding Inverse Reinforcement Learning | Sep 29, 2021 | Decision MakingImitation Learning | —Unverified | 0 |
| Maximizing Ensemble Diversity in Deep Reinforcement Learning | Sep 29, 2021 | Atari GamesDecision Making | —Unverified | 0 |
| SPP-RL: State Planning Policy Reinforcement Learning | Sep 29, 2021 | MuJoCoreinforcement-learning | —Unverified | 0 |
| OVD-Explorer: A General Information-theoretic Exploration Approach for Reinforcement Learning | Sep 29, 2021 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Distributional Decision Transformer for Hindsight Information Matching | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Diverse Imitation Learning via Self-OrganizingGenerative Models | Sep 29, 2021 | Imitation LearningMuJoCo | —Unverified | 0 |
| Evaluating Robustness of Cooperative MARL | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |