| Poisoning Deep Reinforcement Learning Agents with In-Distribution Triggers | Jun 14, 2021 | Data PoisoningDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Policies for the Dynamic Traveling Maintainer Problem with Alerts | May 31, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Policy-Based Bayesian Experimental Design for Non-Differentiable Implicit Models | Mar 8, 2022 | Deep Reinforcement LearningExperimental Design | —Unverified | 0 | 0 |
| Policy Design for Active Sequential Hypothesis Testing using Deep Learning | Oct 11, 2018 | Deep LearningDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability | May 18, 2022 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Policy Entropy for Out-of-Distribution Classification | May 25, 2020 | BenchmarkingClassification | —Unverified | 0 | 0 |
| PolicyGNN: Aggregation Optimization for Graph Neural Networks | Feb 1, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Policy Gradient For Multidimensional Action Spaces: Action Sampling and Entropy Bonus | Jan 1, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Policy Networks with Two-Stage Training for Dialogue Systems | Jun 10, 2016 | Deep Reinforcement LearningDialogue State Tracking | —Unverified | 0 | 0 |
| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 | 0 |
| Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations | Dec 30, 2023 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space | Sep 15, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Policy Search in Continuous Action Domains: an Overview | Mar 13, 2018 | Bayesian OptimizationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| POMDPs in Continuous Time and Discrete Spaces | Oct 2, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Population-aware Online Mirror Descent for Mean-Field Games by Deep Reinforcement Learning | Mar 6, 2024 | Deep Reinforcement Learning | —Unverified | 0 | 0 |
| Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning | Jun 15, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 | 0 |
| Portfolio Management using Deep Reinforcement Learning | May 1, 2024 | Algorithmic TradingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Portfolio Optimization with 2D Relative-Attentional Gated Transformer | Dec 27, 2020 | Deep Reinforcement LearningPortfolio Optimization | —Unverified | 0 | 0 |
| Position-Agnostic Autonomous Navigation in Vineyards with Deep Reinforcement Learning | Jun 28, 2022 | Autonomous NavigationDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Power Allocation in Cache-Aided NOMA Systems: Optimization and Deep Reinforcement Learning Approaches | Sep 24, 2019 | Deep Reinforcement LearningFairness | —Unverified | 0 | 0 |
| PowerNet: Multi-agent Deep Reinforcement Learning for Scalable Powergrid Control | Nov 24, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| PPO-UE: Proximal Policy Optimization via Uncertainty-Aware Exploration | Dec 13, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Practicable Black-box Evasion Attacks on Link Prediction in Dynamic Graphs -- A Graph Sequential Embedding Method | Dec 17, 2024 | Deep Reinforcement LearningLink Prediction | —Unverified | 0 | 0 |
| Practical Marginalized Importance Sampling with the Successor Representation | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Precision medicine as a control problem: Using simulation and deep reinforcement learning to discover adaptive, personalized multi-cytokine therapy for sepsis | Feb 8, 2018 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 | 0 |