| Continuous Neural Algorithmic Planners | Nov 29, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Continuous Mean-Zero Disagreement-Regularized Imitation Learning (CMZ-DRIL) | Mar 2, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| A Unified Bellman Optimality Principle Combining Reward Maximization and Empowerment | Jul 26, 2019 | MuJoCoReinforcement Learning | —Unverified | 0 |
| Improving Context-Based Meta-Reinforcement Learning with Self-Supervised Trajectory Contrastive Learning | Mar 10, 2021 | Contrastive LearningMeta Reinforcement Learning | —Unverified | 0 |
| Improving Actor-Critic Reinforcement Learning via Hamiltonian Monte Carlo Method | Mar 22, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Continuously Discovering Novel Strategies via Reward-Switching Policy Optimization | Apr 4, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience | Sep 24, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates | Oct 9, 2023 | MuJoCo | —Unverified | 0 |
| Attraction-Repulsion Actor-Critic for Continuous Control Reinforcement Learning | Sep 17, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Adversarial Imitation Learning via Random Search | Aug 21, 2020 | Computational EfficiencyDeep Reinforcement Learning | —Unverified | 0 |