| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Population-Guided Imitation Learning | Sep 27, 2020 | Atari GamesImitation Learning | —Unverified | 0 |
| Practical Marginalized Importance Sampling with the Successor Representation | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Multi-task Batch Reinforcement Learning with Metric Learning | Sep 25, 2019 | Meta Reinforcement LearningMetric Learning | —Unverified | 0 |
| Improved Robustness and Safety for Pre-Adaptation of Meta Reinforcement Learning with Prior Regularization | Aug 19, 2021 | Autonomous VehiclesDecision Making | —Unverified | 0 |
| Prompting Decision Transformer for Few-Shot Policy Generalization | Jun 27, 2022 | Few-Shot LearningInductive Bias | —Unverified | 0 |
| Proximal Policy Optimization via Enhanced Exploration Efficiency | Nov 11, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Pure Planning to Pure Policies and In Between with a Recursive Tree Planner | May 21, 2024 | MuJoCo | —Unverified | 0 |
| Quality Diversity Imitation Learning | Oct 8, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning Policies through Quantile Regression | Jun 27, 2019 | MuJoCoquantile regression | —Unverified | 0 |
| Reward Prediction Error as an Exploration Objective in Deep RL | Jun 19, 2019 | Atari GamesContinuous Control | —Unverified | 0 |
| α-Rank: Multi-Agent Evaluation by Evolution | Mar 4, 2019 | Mathematical ProofsMuJoCo | —Unverified | 0 |
| RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors | Dec 14, 2024 | Adversarial AttackDeep Reinforcement Learning | —Unverified | 0 |
| Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning | Dec 13, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Recursive Least Squares Advantage Actor-Critic Algorithms | Jan 15, 2022 | Computational Efficiencycontinuous-control | —Unverified | 0 |
| Regularly Updated Deterministic Policy Gradient Algorithm | Jul 1, 2020 | MuJoCoQ-Learning | —Unverified | 0 |
| Regulatory Focus: Promotion and Prevention Inclinations in Policy Search | Sep 25, 2019 | Atari Gamescontinuous-control | —Unverified | 0 |
| Reinforcement Learning using Guided Observability | Apr 22, 2021 | Decision MakingMuJoCo | —Unverified | 0 |
| Relationship Explainable Multi-objective Reinforcement Learning with Semantic Explainability Generation | Sep 26, 2019 | MuJoCoMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Relative Policy-Transition Optimization for Fast Policy Transfer | Jun 13, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Residual Policy Gradient: A Reward View of KL-regularized Objective | Mar 14, 2025 | Imitation LearningMuJoCo | —Unverified | 0 |
| Resolving Copycat Problems in Visual Imitation Learning via Residual Action Prediction | Jul 20, 2022 | Imitation LearningMuJoCo | —Unverified | 0 |
| Reward function shape exploration in adversarial imitation learning: an empirical study | Apr 14, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Reward Shaping Using Convolutional Neural Network | Oct 30, 2022 | MuJoCo | —Unverified | 0 |
| Risk Averse Value Expansion for Sample Efficient and Robust Policy Learning | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Risk-Sensitive Generative Adversarial Imitation Learning | Aug 13, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| Surfer: Progressive Reasoning with World Models for Robotic Manipulation | Jun 20, 2023 | Decision MakingMuJoCo | —Unverified | 0 |
| Robust Adversarial Reinforcement Learning via Bounded Rationality Curricula | Nov 3, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification | Oct 20, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| On the Benefits of Inducing Local Lipschitzness for Robust Generative Adversarial Imitation Learning | Jun 30, 2021 | Imitation LearningMuJoCo | —Unverified | 0 |
| Robust Imitation of Diverse Behaviors | Jul 10, 2017 | Imitation LearningMuJoCo | —Unverified | 0 |
| Robust Model Based Reinforcement Learning Using L_1 Adaptive Control | Mar 21, 2024 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Robust Offline Reinforcement Learning with Gradient Penalty and Constraint Relaxation | Oct 19, 2022 | D4RLMuJoCo | —Unverified | 0 |
| Robust Reinforcement Learning for Continuous Control with Model Misspecification | Jun 18, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Robust Reinforcement Learning through Efficient Adversarial Herding | Jun 12, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| rQdia: Regularizing Q-Value Distributions With Image Augmentation | Jun 26, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Safe adaptation in multiagent competition | Mar 14, 2022 | MuJoCo | —Unverified | 0 |
| Safe Domain Randomization via Uncertainty-Aware Out-of-Distribution Detection and Policy Adaptation | Jul 8, 2025 | MuJoCoOut-of-Distribution Detection | —Unverified | 0 |
| Safe Policy Learning for Continuous Control | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| SALE-Based Offline Reinforcement Learning with Ensemble Q-Networks | Jan 7, 2025 | D4RLDiversity | —Unverified | 0 |
| Sample-efficient Adversarial Imitation Learning | Mar 14, 2023 | Decision MakingImitation Learning | —Unverified | 0 |
| Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs | Jul 21, 2023 | MuJoCoRepresentation Learning | —Unverified | 0 |
| SEERL: Sample Efficient Ensemble Reinforcement Learning | Jan 15, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Self-Supervised Continuous Control without Policy Gradient | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Relevance-Guided Modeling of Object Dynamics for Reinforcement Learning | Mar 3, 2020 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Taming Multi-Agent Reinforcement Learning with Estimator Variance Reduction | Sep 2, 2022 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| SEREN: Knowing When to Explore and When to Exploit | May 30, 2022 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Similarity-based Knowledge Transfer for Cross-Domain Reinforcement Learning | Dec 5, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Simple Emergent Action Representations from Multi-Task Policy Training | Oct 18, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Simultaneous Training of First- and Second-Order Optimizers in Population-Based Reinforcement Learning | Aug 27, 2024 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |