| Reward Prediction Error as an Exploration Objective in Deep RL | Jun 19, 2019 | Atari GamesContinuous Control | —Unverified | 0 |
| α-Rank: Multi-Agent Evaluation by Evolution | Mar 4, 2019 | Mathematical ProofsMuJoCo | —Unverified | 0 |
| RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors | Dec 14, 2024 | Adversarial AttackDeep Reinforcement Learning | —Unverified | 0 |
| Recruitment-imitation Mechanism for Evolutionary Reinforcement Learning | Dec 13, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Recursive Least Squares Advantage Actor-Critic Algorithms | Jan 15, 2022 | Computational Efficiencycontinuous-control | —Unverified | 0 |
| Regularly Updated Deterministic Policy Gradient Algorithm | Jul 1, 2020 | MuJoCoQ-Learning | —Unverified | 0 |
| Regulatory Focus: Promotion and Prevention Inclinations in Policy Search | Sep 25, 2019 | Atari Gamescontinuous-control | —Unverified | 0 |
| Reinforcement Learning using Guided Observability | Apr 22, 2021 | Decision MakingMuJoCo | —Unverified | 0 |
| Relationship Explainable Multi-objective Reinforcement Learning with Semantic Explainability Generation | Sep 26, 2019 | MuJoCoMulti-Objective Reinforcement Learning | —Unverified | 0 |
| Relative Policy-Transition Optimization for Fast Policy Transfer | Jun 13, 2022 | continuous-controlContinuous Control | —Unverified | 0 |