| Policy Optimization with Model-based Explorations | Nov 18, 2018 | Atari GamesDecision Making | —Unverified | 0 | 0 |
| Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space | Sep 15, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Policy-shaped prediction: avoiding distractions in model-based reinforcement learning | Dec 8, 2024 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Posterior Coreset Construction with Kernelized Stein Discrepancy for Model-Based Reinforcement Learning | Jun 2, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Predictive Control Using Learned State Space Models via Rolling Horizon Evolution | Jun 25, 2021 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Learning-to-defer for sequential medical decision-making under uncertainty | Sep 13, 2021 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 | 0 |
| Pre-Trained Video Generative Models as World Simulators | Feb 10, 2025 | Model-based Reinforcement Learning | —Unverified | 0 | 0 |
| PrivilegedDreamer: Explicit Imagination of Privileged Information for Rapid Adaptation of Learned Policies | Feb 17, 2025 | Autonomous DrivingDomain Adaptation | —Unverified | 0 | 0 |
| Procedural Generalization by Planning with Self-Supervised World Models | Nov 2, 2021 | BenchmarkingMeta-Learning | —Unverified | 0 | 0 |
| Prototypical context-aware dynamics generalization for high-dimensional model-based reinforcement learning | Nov 23, 2022 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization | Feb 14, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Provably Efficient Reinforcement Learning for Online Adaptive Influence Maximization | Jun 29, 2022 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Q-CP: Learning Action Values for Cooperative Planning | Mar 1, 2018 | Model-based Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Raising Student Completion Rates with Adaptive Curriculum and Contextual Bandits | Jul 28, 2022 | Model-based Reinforcement LearningMulti-Armed Bandits | —Unverified | 0 | 0 |
| Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2) | May 22, 2025 | Autonomous DrivingBench2Drive | —Unverified | 0 | 0 |
| Ready Policy One: World Building Through Active Learning | Feb 7, 2020 | Active Learningcontinuous-control | —Unverified | 0 | 0 |
| Regularity as Intrinsic Reward for Free Play | Dec 3, 2023 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Regularizing Model-Based Planning with Energy-Based Models | Oct 12, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Regularizing Trajectory Optimization with Denoising Autoencoders | Mar 28, 2019 | DenoisingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Reinforcement learning-based architecture search for quantum machine learning | Jun 4, 2024 | Model-based Reinforcement LearningQuantum Machine Learning | —Unverified | 0 | 0 |
| Infinite-Horizon Reinforcement Learning with Multinomial Logistic Function Approximation | Jun 19, 2024 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement learning for non-prehensile manipulation: Transfer from simulation to physical system | Mar 28, 2018 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reinforcement Learning on Computational Resource Allocation of Cloud-based Wireless Networks | Oct 10, 2020 | CPUManagement | —Unverified | 0 | 0 |
| Biomanufacturing Harvest Optimization with Small Data | Jan 11, 2021 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Reinforcement Learning with a Disentangled Universal Value Function for Item Recommendation | Apr 7, 2021 | Model-based Reinforcement LearningRecommendation Systems | —Unverified | 0 | 0 |
| Reinforcement Learning with Efficient Active Feature Acquisition | Nov 2, 2020 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Reinforcement Learning with Non-Exponential Discounting | Sep 27, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Reinforcement Twinning: from digital twins to model-based reinforcement learning | Nov 7, 2023 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis | Jun 14, 2025 | Model-based Reinforcement LearningPrivacy Preserving | —Unverified | 0 | 0 |
| Replay Buffer with Local Forgetting for Adapting to Local Environment Changes in Deep Model-Based Reinforcement Learning | Mar 15, 2023 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Representation Balancing Offline Model-based Reinforcement Learning | Jan 1, 2021 | modelModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Revisiting Design Choices in Offline Model-Based Reinforcement Learning | Oct 8, 2021 | Bayesian OptimizationModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Revisiting Design Choices in Offline Model Based Reinforcement Learning | May 21, 2021 | Bayesian OptimizationModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Revisiting Model-based Value Expansion | Mar 28, 2022 | GPUmodel | —Unverified | 0 | 0 |
| Revisit Policy Optimization in Matrix Form | Sep 19, 2019 | FormModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning | Oct 9, 2023 | D4RLModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Reward-Free Model-Based Reinforcement Learning with Linear Function Approximation | Oct 12, 2021 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Reward-Respecting Subtasks for Model-Based Reinforcement Learning | Feb 7, 2022 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Risk Averse Value Expansion for Sample Efficient and Robust Policy Learning | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Risk-Sensitive and Robust Model-Based Reinforcement Learning and Planning | Apr 2, 2023 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Risk Sensitive Model-Based Reinforcement Learning using Uncertainty Guided Planning | Nov 9, 2021 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| RLOC: Neurobiologically Inspired Hierarchical Reinforcement Learning Algorithm for Continuous Control of Nonlinear Dynamical Systems | Mar 7, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| RL-QN: A Reinforcement Learning Framework for Optimal Control of Queueing Systems | Nov 14, 2020 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Robo-PlaNet: Learning to Poke in a Day | Nov 9, 2019 | Model-based Reinforcement LearningPosition | —Unverified | 0 | 0 |
| Robotic World Model: A Neural Network Simulator for Robust Policy Optimization in Robotics | Jan 17, 2025 | Model-based Reinforcement Learning | —Unverified | 0 | 0 |
| Decision-Focused Model-based Reinforcement Learning for Reward Transfer | Apr 6, 2023 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Robust Model Based Reinforcement Learning Using L_1 Adaptive Control | Mar 21, 2024 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Robust Model-based Reinforcement Learning for Autonomous Greenhouse Control | Aug 26, 2021 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| S2VG: Soft Stochastic Value Gradient method | Sep 25, 2019 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |