| Proto-Value Networks: Scaling Representation Learning with Auxiliary Tasks | Apr 25, 2023 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Provable Performance Bounds for Digital Twin-driven Deep Reinforcement Learning in Wireless Networks: A Novel Digital-Twin Bisimulation Metric | Feb 25, 2025 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Provably Efficient Causal Reinforcement Learning with Confounded Observational Data | Jun 22, 2020 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Provably Efficient Representation Selection in Low-rank Markov Decision Processes: From Online to Offline RL | Jun 22, 2021 | Deep Reinforcement LearningOffline RL | —Unverified | 0 | 0 |
| Provably Safe Deep Reinforcement Learning for Robotic Manipulation in Human Environments | May 12, 2022 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 | 0 |
| Proximal Policy Optimization Based Reinforcement Learning for Joint Bidding in Energy and Frequency Regulation Markets | Dec 13, 2022 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Proximal Policy Optimization via Enhanced Exploration Efficiency | Nov 11, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Proximal Policy Optimization with Adaptive Threshold for Symmetric Relative Density Ratio | Mar 18, 2022 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Proximal Policy Optimization with Graph Neural Networks for Optimal Power Flow | Dec 23, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Proximal Policy Optimization with Relative Pearson Divergence | Oct 7, 2020 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Proxy Experience Replay: Federated Distillation for Distributed Reinforcement Learning | May 13, 2020 | ClusteringData Augmentation | —Unverified | 0 | 0 |
| Pseudo-Model-Free Hedging for Variable Annuities via Deep Reinforcement Learning | Jul 7, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Psychotherapy AI Companion with Reinforcement Learning Recommendations and Interpretable Policy Dynamics | Mar 16, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| PTR-PPO: Proximal Policy Optimization with Prioritized Trajectory Replay | Dec 7, 2021 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Pull-Based Query Scheduling for Goal-Oriented Semantic Communication | Mar 9, 2025 | Deep Reinforcement LearningScheduling | —Unverified | 0 | 0 |
| Puppeteer and Marionette: Learning Anticipatory Quadrupedal Locomotion Based on Interactions of a Central Pattern Generator and Supraspinal Drive | Feb 26, 2023 | Deep Reinforcement LearningModel Predictive Control | —Unverified | 0 | 0 |
| Putting the Iterative Training of Decision Trees to the Test on a Real-World Robotic Task | Dec 6, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| PyDCM: Custom Data Center Models with Reinforcement Learning for Sustainability | Oct 5, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| QAmplifyNet: Pushing the Boundaries of Supply Chain Backorder Prediction Using Interpretable Hybrid Quantum-Classical Neural Network | Jul 24, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Qd-tree: Learning Data Layouts for Big Data Analytics | Apr 22, 2020 | BlockingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Qgraph-bounded Q-learning: Stabilizing Model-Free Off-Policy Deep Reinforcement Learning | Jul 15, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Q-LDA: Uncovering Latent Patterns in Text-based Sequential Decision Processes | Dec 1, 2017 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Q-learning as a monotone scheme | May 30, 2024 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| QoE Optimization for Live Video Streaming in UAV-to-UAV Communications via Deep Reinforcement Learning | Feb 21, 2021 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| QoS and Jamming-Aware Wireless Networking Using Deep Reinforcement Learning | Oct 13, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| QoS-Aware Scheduling in New Radio Using Deep Reinforcement Learning | Jul 14, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Quality-Aware Multimodal Saliency Detection via Deep Reinforcement Learning | Nov 27, 2018 | Decision MakingDecoder | —Unverified | 0 | 0 |
| Quality of service based radar resource management using deep reinforcement learning | Oct 20, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| QuantFactor REINFORCE: Mining Steady Formulaic Alpha Factors with Variance-bounded REINFORCE | Sep 8, 2024 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Automated Quantification of CT Patterns Associated with COVID-19 from Chest CT | Apr 2, 2020 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| Quantifying Multimodality in World Models | Dec 14, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Quantitative Day Trading from Natural Language using Reinforcement Learning | Jun 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Quantity vs. Quality: On Hyperparameter Optimization for Deep Reinforcement Learning | Jul 29, 2020 | Bayesian OptimizationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Quantum Architecture Search via Continual Reinforcement Learning | Dec 10, 2021 | Continual LearningDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Quantum Circuit Optimization with AlphaTensor | Feb 22, 2024 | Deep Reinforcement LearningTensor Decomposition | —Unverified | 0 | 0 |
| Quantum Control based on Deep Reinforcement Learning | Dec 14, 2022 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Quantum Deep Hedging | Mar 29, 2023 | Deep Reinforcement LearningQuantum Machine Learning | —Unverified | 0 | 0 |
| Quantum Machine Learning Architecture Search via Deep Reinforcement Learning | Jul 29, 2024 | Deep Reinforcement LearningQuantum Machine Learning | —Unverified | 0 | 0 |
| Quantum Power Electronics: From Theory to Implementation | Mar 8, 2023 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Quantum Reinforcement Learning-Based Two-Stage Unit Commitment Framework for Enhanced Power Systems Robustness | Oct 28, 2024 | Computational EfficiencyDecision Making | —Unverified | 0 | 0 |
| Deep Reinforcement Learning via L-BFGS Optimization | Nov 6, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Quasi-Newton Optimization Methods For Deep Learning Applications | Sep 4, 2019 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Question Answering with Texts and Tables through Deep Reinforcement Learning | Jul 5, 2024 | Deep Reinforcement LearningQuestion Answering | —Unverified | 0 | 0 |
| Quick Learner Automated Vehicle Adapting its Roadmanship to Varying Traffic Cultures with Meta Reinforcement Learning | Apr 18, 2021 | Deep Reinforcement LearningMeta Reinforcement Learning | —Unverified | 0 | 0 |
| Reward Prediction Error as an Exploration Objective in Deep RL | Jun 19, 2019 | Atari GamesContinuous Control | —Unverified | 0 | 0 |
| QXplore: Q-Learning Exploration by Maximizing Temporal Difference Error | Sep 25, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| R2 Indicator and Deep Reinforcement Learning Enhanced Adaptive Multi-Objective Evolutionary Algorithm | Apr 11, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| R3L: Connecting Deep Reinforcement Learning to Recurrent Neural Networks for Image Denoising via Residual Recovery | Jul 12, 2021 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| RACA: Relation-Aware Credit Assignment for Ad-Hoc Cooperation in Multi-Agent Deep Reinforcement Learning | Jun 2, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| RACE-SM: Reinforcement Learning Based Autonomous Control for Social On-Ramp Merging | Mar 5, 2024 | Deep Reinforcement Learning | —Unverified | 0 | 0 |