| Reinforcement Learning for Robotics and Control with Active Uncertainty Reduction | May 15, 2019 | ManagementOpenAI Gym | —Unverified | 0 | 0 |
| Reinforcement Learning for Safe Occupancy Strategies in Educational Spaces during an Epidemic | Dec 23, 2023 | ManagementQ-Learning | —Unverified | 0 | 0 |
| Reinforcement Learning for Slate-based Recommender Systems: A Tractable Decomposition and Practical Methodology | May 29, 2019 | Q-LearningRecommendation Systems | —Unverified | 0 | 0 |
| Reinforcement Learning for Stock Transactions | May 22, 2025 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning for Task Specifications with Action-Constraints | Jan 2, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning for Thermostatically Controlled Loads Control using Modelica and Python | May 9, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning for Traffic Signal Control: Comparison with Commercial Systems | Apr 21, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning from Diffusion Feedback: Q* for Image Search | Nov 27, 2023 | Data AugmentationDiversity | —Unverified | 0 | 0 |
| Deep Reinforcement Learning for FlipIt Security Game | Feb 28, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Reinforcement Learning in Non-Markovian Environments | Nov 3, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning in R | Sep 29, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning in Switching Non-Stationary Markov Decision Processes: Algorithms and Convergence Analysis | Mar 24, 2025 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Reinforcement Learning Models of Human Behavior: Reward Processing in Mental Disorders | Sep 11, 2019 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Reinforcement Learning of Markov Decision Processes with Peak Constraints | Jan 23, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning Problem Solving with Large Language Models | Apr 29, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| On-demand Cold Start Frequency Reduction with Off-Policy Reinforcement Learning in Serverless Computing | Aug 15, 2023 | Cloud ComputingCPU | —Unverified | 0 | 0 |
| Reinforcement learning to maximise wind turbine energy generation | Feb 17, 2024 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning: Tutorial and Survey | Jul 18, 2024 | Deep Reinforcement LearningGeneral Reinforcement Learning | —Unverified | 0 | 0 |
| Reinforcement Learning under Model Mismatch | Jun 15, 2017 | modelQ-Learning | —Unverified | 0 | 0 |
| Reinforcement Learning under Partial Observability Guided by Learned Environment Models | Jun 23, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning using Augmented Neural Networks | Jun 20, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement learning using Deep Q Networks and Q learning accurately localizes brain tumors on MRI with very small training sets | Oct 21, 2020 | Keypoint DetectionQ-Learning | —Unverified | 0 | 0 |
| Reinforcement Learning with Expert Trajectory For Quantitative Trading | May 9, 2021 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning with External Knowledge and Two-Stage Q-functions for Predicting Popular Reddit Threads | Apr 20, 2017 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning With Reward Machines in Stochastic Games | May 27, 2023 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Reinforcement Learning with Structured Hierarchical Grammar Representations of Actions | Oct 7, 2019 | Atari GamesQ-Learning | —Unverified | 0 | 0 |
| Reinforcenment Learning-Aided NOMA Random Access: An AoI-Based Timeliness Perspective | Oct 4, 2024 | Q-Learning | —Unverified | 0 | 0 |
| A Framework of decision-relevant observability: Reinforcement Learning converges under relative ignorability | Apr 10, 2025 | Causal InferenceDecision Making | —Unverified | 0 | 0 |
| RELS-DQN: A Robust and Efficient Local Search Framework for Combinatorial Optimization | Apr 11, 2023 | Combinatorial OptimizationMarketing | —Unverified | 0 | 0 |
| Replay For Safety | Dec 8, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Representation Learning for Context-Dependent Decision-Making | May 12, 2022 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Representing Entropy : A short proof of the equivalence between soft Q-learning and policy gradients | Jan 1, 2018 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reputation Bootstrapping for Composite Services using CP-nets | May 27, 2021 | Q-Learning | —Unverified | 0 | 0 |
| Residual Policy Gradient: A Reward View of KL-regularized Objective | Mar 14, 2025 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Residual Q-Learning: Offline and Online Policy Customization without Value | Jun 15, 2023 | Imitation LearningQ-Learning | —Unverified | 0 | 0 |
| Resilient UAV Trajectory Planning via Few-Shot Meta-Offline Reinforcement Learning | Feb 3, 2025 | Meta-LearningOffline RL | —Unverified | 0 | 0 |
| The state-of-the-art review on resource allocation problem using artificial intelligence methods on various computing paradigms | Mar 23, 2022 | Cloud ComputingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| REValueD: Regularised Ensemble Value-Decomposition for Factorisable Markov Decision Processes | Jan 16, 2024 | Multi-agent Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Reverse Experience Replay | Oct 19, 2019 | Q-Learning | —Unverified | 0 | 0 |
| Reversible Action Design for Combinatorial Optimization with Reinforcement Learning | Feb 14, 2021 | Combinatorial OptimizationQ-Learning | —Unverified | 0 | 0 |
| Reversible Action Design for Combinatorial Optimization with ReinforcementLearning | Nov 24, 2021 | Combinatorial OptimizationQ-Learning | —Unverified | 0 | 0 |
| Reward-Directed Score-Based Diffusion Models via q-Learning | Sep 7, 2024 | Q-LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Risk-Averse Reinforcement Learning via Dynamic Time-Consistent Risk Measures | Jan 14, 2023 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Risk-Sensitive Compact Decision Trees for Autonomous Execution in Presence of Simulated Market Response | Jun 5, 2019 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Risk-sensitive Reinforcement Learning | Nov 8, 2013 | Decision MakingQ-Learning | —Unverified | 0 | 0 |
| Risk-Sensitive Reinforcement Learning: Near-Optimal Risk-Sample Tradeoff in Regret | Jun 22, 2020 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| RL-GA: A Reinforcement Learning-Based Genetic Algorithm for Electromagnetic Detection Satellite Scheduling Problem | Jun 12, 2022 | Q-Learningreinforcement-learning | —Unverified | 0 | 0 |
| Robbins-Monro conditions for persistent exploration learning strategies | Aug 1, 2018 | Q-Learning | —Unverified | 0 | 0 |
| Robotic Search & Rescue via Online Multi-task Reinforcement Learning | Nov 29, 2015 | Lifelong learningQ-Learning | —Unverified | 0 | 0 |
| Robust and Data-efficient Q-learning by Composite Value-estimation | Sep 29, 2021 | Q-Learning | —Unverified | 0 | 0 |