| Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks | Apr 25, 2023 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Provable Performance Bounds for Digital Twin-driven Deep Reinforcement Learning in Wireless Networks: A Novel Digital-Twin Bisimulation Metric | Feb 25, 2025 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Provably Efficient Causal Reinforcement Learning with Confounded Observational Data | Jun 22, 2020 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL | Jun 22, 2021 | Deep Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments | May 12, 2022 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 | 0 |
| Proximal Policy Optimization Based Reinforcement Learning for Joint Bidding in Energy and Frequency Regulation Markets | Dec 13, 2022 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Proximal Policy Optimization via Enhanced Exploration Efficiency | Nov 11, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Proximal Policy Optimization with Adaptive Threshold for Symmetric Relative Density Ratio | Mar 18, 2022 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Proximal Policy Optimization with Graph Neural Networks for Optimal Power Flow | Dec 23, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Proximal Policy Optimization with Relative Pearson Divergence | Oct 7, 2020 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Proxy Experience Replay: Federated Distillation for Distributed Reinforcement Learning | May 13, 2020 | ClusteringData Augmentation | —Unverified | 0 | 0 |
| Pseudo-Model-Free Hedging for Variable Annuities via Deep Reinforcement Learning | Jul 7, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics | Mar 16, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay | Dec 7, 2021 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Pull-Based Query Scheduling for Goal-Oriented Semantic Communication | Mar 9, 2025 | Deep Reinforcement LearningScheduling | —Unverified | 0 | 0 |
| Puppeteer and Marionette: Learning Anticipatory Quadrupedal Locomotion Based on Interactions of a Central Pattern Generator and Supraspinal Drive | Feb 26, 2023 | Deep Reinforcement LearningModel Predictive Control | —Unverified | 0 | 0 |
| Putting the Iterative Training of Decision Trees to the Test on a Real-World Robotic Task | Dec 6, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability | Oct 5, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| QAmplifyNet: Pushing the Boundaries of Supply Chain Backorder Prediction Using Interpretable Hybrid Quantum-Classical Neural Network | Jul 24, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Qd-tree: Learning Data Layouts for Big Data Analytics | Apr 22, 2020 | BlockingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning | Jul 15, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes | Dec 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Q-learning as a monotone scheme | May 30, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| QoE Optimization for Live Video Streaming in UAV-to-UAV Communications via Deep Reinforcement Learning | Feb 21, 2021 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| QoS and Jamming-Aware Wireless Networking Using Deep Reinforcement Learning | Oct 13, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |