| Gaussian Process Policy Optimization | Mar 2, 2020 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Generalized Hidden Parameter MDPs Transferable Model-based RL in a Handful of Trials | Feb 8, 2020 | MuJoCo | —Unverified | 0 | 0 |
| Generalized Maximum Entropy Reinforcement Learning via Reward Shaping | Sep 29, 2021 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Generalized Off-Policy Actor-Critic | Mar 27, 2019 | counterfactualMuJoCo | —Unverified | 0 | 0 |
| Generative Adversarial Self-Imitation Learning | Dec 3, 2018 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Genetic Imitation Learning by Reward Extrapolation | Jan 3, 2023 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Global Convergence of Natural Policy Gradient with Hessian-aided Momentum Variance Reduction | Jan 2, 2024 | MuJoCoPolicy Gradient Methods | —Unverified | 0 | 0 |
| Good Better Best: Self-Motivated Imitation Learning for noisy Demonstrations | Oct 24, 2023 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Gradientless Descent: High-Dimensional Zeroth-Order Optimization | Nov 14, 2019 | MuJoCoVocal Bursts Intensity Prediction | —Unverified | 0 | 0 |
| Gradient Monitored Reinforcement Learning | May 25, 2020 | Atari Gamescontinuous-control | —Unverified | 0 | 0 |
| GRI: General Reinforced Imitation and its Application to Vision-Based Autonomous Driving | Nov 16, 2021 | Autonomous DrivingCARLA MAP Leaderboard | —Unverified | 0 | 0 |
| Group Distributionally Robust Reinforcement Learning with Hierarchical Latent Variables | Oct 21, 2022 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning | Jan 24, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Hamiltonian Policy Optimization | Feb 28, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Hamiltonian Policy Optimization in Reinforcement Learning | Mar 23, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Hard Contacts with Soft Gradients: Refining Differentiable Simulators for Learning and Control | Jun 17, 2025 | MuJoCo | —Unverified | 0 | 0 |
| Hellinger Distance Constrained Regression | Jan 1, 2021 | MuJoCoregression | —Unverified | 0 | 0 |
| Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL | Aug 2, 2022 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy Generalization with Global and Adaptive Guidance | Dec 1, 2024 | MuJoCo | —Unverified | 0 | 0 |
| Hierarchical Reinforcement Learning of Locomotion Policies in Response to Approaching Objects: A Preliminary Study | Mar 20, 2022 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 | 0 |
| Hierarchical RL Using an Ensemble of Proprioceptive Periodic Policies | May 1, 2019 | MuJoCo | —Unverified | 0 | 0 |
| Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning | Dec 19, 2024 | Continual LearningMuJoCo | —Unverified | 0 | 0 |
| Hindsight Experience Replay with Kronecker Product Approximate Curvature | Oct 9, 2020 | MuJoCo | —Unverified | 0 | 0 |
| Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning | Sep 6, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Hybrid Value Estimation for Off-policy Evaluation and Offline Reinforcement Learning | Jun 4, 2022 | MuJoCoOff-policy evaluation | —Unverified | 0 | 0 |
| Hypothesis Driven Coordinate Ascent for Reinforcement Learning | Sep 29, 2021 | MuJoCoOpenAI Gym | —Unverified | 0 | 0 |
| IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic | Feb 27, 2025 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Imitating Opponent to Win: Adversarial Policy Imitation Learning in Two-player Competitive Games | Oct 30, 2022 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 | 0 |
| Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration | Nov 11, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Forward and inverse reinforcement learning sharing network weights and hyperparameters | Aug 17, 2020 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Imitation Learning from Video by Leveraging Proprioception | May 22, 2019 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates | Oct 9, 2023 | MuJoCo | —Unverified | 0 | 0 |
| Improved Soft Actor-Critic: Mixing Prioritized Off-Policy Samples with On-Policy Experience | Sep 24, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Improving Actor-Critic Reinforcement Learning via Hamiltonian Monte Carlo Method | Mar 22, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Improving Context-Based Meta-Reinforcement Learning with Self-Supervised Trajectory Contrastive Learning | Mar 10, 2021 | Contrastive LearningMeta Reinforcement Learning | —Unverified | 0 | 0 |
| Improving Learning from Demonstrations by Learning from Experience | Nov 16, 2021 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Improving On-policy Learning with Statistical Reward Accumulation | Sep 7, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Improving Sample Efficiency in Evolutionary RL Using Off-Policy Ranking | Aug 22, 2022 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Inferring DQN structure for high-dimensional continuous control | Jan 1, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Intrinsically Guided Exploration in Meta Reinforcement Learning | Jan 1, 2021 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 | 0 |
| Invariant Representations for Reinforcement Learning without Reconstruction | Jan 1, 2021 | Causal InferenceMuJoCo | —Unverified | 0 | 0 |
| Inverse Delayed Reinforcement Learning | Dec 4, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Inverse Reinforcement Learning from a Gradient-based Learner | Jul 15, 2020 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| IRIS: An Immersive Robot Interaction System | Feb 5, 2025 | MuJoCoUnity | —Unverified | 0 | 0 |
| Iterated Q-Network: Beyond One-Step Bellman Updates in Deep Reinforcement Learning | Mar 4, 2024 | Atari Gamescontinuous-control | —Unverified | 0 | 0 |
| Iterative Reachability Estimation for Safe Reinforcement Learning | Sep 24, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Learning Reward and Policy Jointly from Demonstration and Preference Improves Alignment | Jun 11, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Keyframe-Focused Visual Imitation Learning | Jun 11, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Language to Rewards for Robotic Skill Synthesis | Jun 14, 2023 | In-Context LearningLogical Reasoning | —Unverified | 0 | 0 |
| Latent Space Energy-based Neural ODEs | Sep 5, 2024 | MuJoCo | —Unverified | 0 | 0 |