| Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability | May 18, 2022 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Policy Entropy for Out-of-Distribution Classification | May 25, 2020 | BenchmarkingClassification | —Unverified | 0 |
| PolicyGNN: Aggregation Optimization for Graph Neural Networks | Feb 1, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Policy Gradient For Multidimensional Action Spaces: Action Sampling and Entropy Bonus | Jan 1, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Policy Networks with Two-Stage Training for Dialogue Systems | Jun 10, 2016 | Deep Reinforcement LearningDialogue State Tracking | —Unverified | 0 |
| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations | Dec 30, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space | Sep 15, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Search in Continuous Action Domains: an Overview | Mar 13, 2018 | Bayesian OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| POMDPs in Continuous Time and Discrete Spaces | Oct 2, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Population-aware Online Mirror Descent for Mean-Field Games by Deep Reinforcement Learning | Mar 6, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning | Jun 15, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems | Dec 14, 2023 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenance | Jul 16, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Meta Reinforcement Learning with Task Embedding and Shared Policy | May 16, 2019 | Deep Reinforcement LearningMeta-Learning | CodeCode Available | 0 |
| Deep RTS: A Game Environment for Deep Reinforcement Learning in Real-Time Strategy Games | Aug 15, 2018 | Deep Reinforcement LearningReal-Time Strategy Games | CodeCode Available | 0 |
| The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning | Oct 20, 2022 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Global and Local Analysis of Interestingness for Competency-Aware Deep Reinforcement Learning | Nov 11, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| DASHA: Decentralized Autofocusing System with Hierarchical Agents | Aug 29, 2021 | Deep Reinforcement Learningobject-detection | CodeCode Available | 0 |
| Understanding the Evolution of Linear Regions in Deep Reinforcement Learning | Oct 24, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Deep reinforcement learning with time-scale invariant memory | Dec 19, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Dealing with Sparse Rewards in Reinforcement Learning | Oct 21, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Neural Replicator Dynamics | Jun 1, 2019 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning | Jan 6, 2024 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| MICo: Improved representations via sampling-based state similarity for Markov decision processes | Jun 3, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |