| Revisiting Estimation Bias in Policy Gradients for Deep Reinforcement Learning | Jan 20, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Revisiting Experience Replayable Conditions | Feb 15, 2024 | Deep Reinforcement LearningMetric Learning | —Unverified | 0 |
| Revisiting Space Mission Planning: A Reinforcement Learning-Guided Approach for Multi-Debris Rendezvous | Sep 25, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Revisiting the Master-Slave Architecture in Multi-Agent Deep Reinforcement Learning | Dec 20, 2017 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reward Function Optimization of a Deep Reinforcement Learning Collision Avoidance System | Dec 1, 2022 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| Rewarding Episodic Visitation Discrepancy for Exploration in Reinforcement Learning | Sep 19, 2022 | Atari GamesBenchmarking | —Unverified | 0 |
| Reward Shaping for Happier Autonomous Cyber Security Agents | Oct 20, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reward Shifting for Optimistic Exploration and Conservative Exploitation | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Rewards with Negative Examples for Reinforced Topic-Focused Abstractive Summarization | Nov 1, 2021 | Abstractive Text SummarizationDeep Reinforcement Learning | —Unverified | 0 |
| RGMDT: Return-Gap-Minimizing Decision Tree Extraction in Non-Euclidean Metric Space | Oct 21, 2024 | ClusteringD4RL | —Unverified | 0 |