| Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies | May 1, 2019 | MuJoCo | —Unverified | 0 | 0 |
| Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning | Dec 19, 2024 | Continual LearningMuJoCo | —Unverified | 0 | 0 |
| Hindsight Experience Replay with Kronecker Product Approximate Curvature | Oct 9, 2020 | MuJoCo | —Unverified | 0 | 0 |
| Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning | Sep 6, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning | Jun 4, 2022 | MuJoCoOff-policy evaluation | —Unverified | 0 | 0 |
| Hypothesis Driven Coordinate Ascent for Reinforcement Learning | Sep 29, 2021 | MuJoCoOpenAI Gym | —Unverified | 0 | 0 |
| IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic | Feb 27, 2025 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Imitating Opponent to Win: Adversarial Policy Imitation Learning in Two-player Competitive Games | Oct 30, 2022 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 | 0 |
| Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration | Nov 11, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Forward and inverse reinforcement learning sharing network weights and hyperparameters | Aug 17, 2020 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |