| Personalized Lane Change Decision Algorithm Using Deep Reinforcement Learning Approach | Dec 17, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Personalized QoE Enhancement for Adaptive Video Streaming: A Digital Twin-Assisted Scheme | May 9, 2022 | Deep Reinforcement LearningManagement | —Unverified | 0 | 0 |
| Perspective Taking in Deep Reinforcement Learning Agents | Jul 3, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Perturbation-based exploration methods in deep reinforcement learning | Nov 10, 2020 | Atari GamesBenchmarking | —Unverified | 0 | 0 |
| Pessimistic Model Selection for Offline Deep Reinforcement Learning | Nov 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Petri Net Machines for Human-Agent Interaction | Sep 13, 2019 | Deep Reinforcement LearningReinforcement Learning | —Unverified | 0 | 0 |
| PGN: A perturbation generation network against deep reinforcement learning | Dec 20, 2023 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| PGPS : Coupling Policy Gradient with Population-based Search | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Physical Deep Reinforcement Learning Towards Safety Guarantee | Mar 29, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Physical Informed-Inspired Deep Reinforcement Learning Based Bi-Level Programming for Microgrid Scheduling | Oct 15, 2024 | AutoMLComputational Efficiency | —Unverified | 0 | 0 |
| Physics-Based Trajectory Design for Cellular-Connected UAV in Rainy Environments Based on Deep Reinforcement Learning | Aug 31, 2023 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 | 0 |
| Physics-Guided Hierarchical Reward Mechanism for Learning-Based Robotic Grasping | May 26, 2022 | Computational EfficiencyDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Physics-informed Dyna-Style Model-Based Deep Reinforcement Learning for Dynamic Control | Jul 31, 2021 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 | 0 |
| Physics-informed Modularized Neural Network for Advanced Building Control by Deep Reinforcement Learning | Apr 7, 2025 | Deep Reinforcement LearningPhysics-informed machine learning | —Unverified | 0 | 0 |
| Physics-model-guided Worst-case Sampling for Safe Reinforcement Learning | Dec 17, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Closed Drafting as a Case Study for First-Principle Interpretability, Memory, and Generalizability in Deep Reinforcement Learning | Oct 31, 2023 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| PICTS: A Novel Deep Reinforcement Learning Approach for Dynamic P-I Control in Scanning Probe Microscopy | Feb 11, 2025 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Placement Optimization of Aerial Base Stations with Deep Reinforcement Learning | Nov 19, 2019 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Placement Optimization with Deep Reinforcement Learning | Mar 18, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| PRIMA: Planner-Reasoner Inside a Multi-task Reasoning Agent | Feb 1, 2022 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Plasticity Loss in Deep Reinforcement Learning: A Survey | Nov 7, 2024 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Playing Atari with Capsule Networks: A systematic comparison of CNN and CapsNets-based agents. | Jan 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |
| Playing optical tweezers with deep reinforcement learning: in virtual, physical and augmented environments | Nov 5, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Playing Text-Based Games with Common Sense | Dec 4, 2020 | Common Sense ReasoningDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Point Cloud Scene Completion with Joint Color and Semantic Estimation from Single RGB-D Image | Oct 12, 2022 | Deep Reinforcement LearningImage Inpainting | —Unverified | 0 | 0 |
| Poisoning Deep Reinforcement Learning Agents with In-Distribution Triggers | Jun 14, 2021 | Data PoisoningDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Policies for the Dynamic Traveling Maintainer Problem with Alerts | May 31, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Policy-Based Bayesian Experimental Design for Non-Differentiable Implicit Models | Mar 8, 2022 | Deep Reinforcement LearningExperimental Design | —Unverified | 0 | 0 |
| Policy Design for Active Sequential Hypothesis Testing using Deep Learning | Oct 11, 2018 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability | May 18, 2022 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Policy Entropy for Out-of-Distribution Classification | May 25, 2020 | BenchmarkingClassification | —Unverified | 0 | 0 |
| PolicyGNN: Aggregation Optimization for Graph Neural Networks | Feb 1, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Policy Gradient For Multidimensional Action Spaces: Action Sampling and Entropy Bonus | Jan 1, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Policy Networks with Two-Stage Training for Dialogue Systems | Jun 10, 2016 | Deep Reinforcement LearningDialogue State Tracking | —Unverified | 0 | 0 |
| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 | 0 |
| Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations | Dec 30, 2023 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space | Sep 15, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Policy Search in Continuous Action Domains: an Overview | Mar 13, 2018 | Bayesian OptimizationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| POMDPs in Continuous Time and Discrete Spaces | Oct 2, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Population-aware Online Mirror Descent for Mean-Field Games by Deep Reinforcement Learning | Mar 6, 2024 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning | Jun 15, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 | 0 |
| Portfolio Management using Deep Reinforcement Learning | May 1, 2024 | Algorithmic TradingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Portfolio Optimization with 2D Relative-Attentional Gated Transformer | Dec 27, 2020 | Deep Reinforcement LearningPortfolio Optimization | —Unverified | 0 | 0 |
| Position-Agnostic Autonomous Navigation in Vineyards with Deep Reinforcement Learning | Jun 28, 2022 | Autonomous NavigationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Power Allocation in Cache-Aided NOMA Systems: Optimization and Deep Reinforcement Learning Approaches | Sep 24, 2019 | Deep Reinforcement LearningFairness | —Unverified | 0 | 0 |
| PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control | Nov 24, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration | Dec 13, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Practicable Black-box Evasion Attacks on Link Prediction in Dynamic Graphs -- A Graph Sequential Embedding Method | Dec 17, 2024 | Deep Reinforcement LearningLink Prediction | —Unverified | 0 | 0 |
| Practical Marginalized Importance Sampling with the Successor Representation | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Precision medicine as a control problem: Using simulation and deep reinforcement learning to discover adaptive, personalized multi-cytokine therapy for sepsis | Feb 8, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |