| Policy Distillation with Selective Input Gradient Regularization for Efficient Interpretability | May 18, 2022 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| Policy Entropy for Out-of-Distribution Classification | May 25, 2020 | BenchmarkingClassification | —Unverified | 0 |
| PolicyGNN: Aggregation Optimization for Graph Neural Networks | Feb 1, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Policy Gradient For Multidimensional Action Spaces: Action Sampling and Entropy Bonus | Jan 1, 2018 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Policy Networks with Two-Stage Training for Dialogue Systems | Jun 10, 2016 | Deep Reinforcement LearningDialogue State Tracking | —Unverified | 0 |
| Policy Optimization by Genetic Distillation | Nov 3, 2017 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Policy Optimization with Smooth Guidance Learned from State-Only Demonstrations | Dec 30, 2023 | Deep Reinforcement Learning | —Unverified | 0 |
| Policy Prediction Network: Model-Free Behavior Policy with Model-Based Learning in Continuous Action Space | Sep 15, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Policy Search in Continuous Action Domains: an Overview | Mar 13, 2018 | Bayesian OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| POMDPs in Continuous Time and Discrete Spaces | Oct 2, 2020 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Population-aware Online Mirror Descent for Mean-Field Games by Deep Reinforcement Learning | Mar 6, 2024 | Deep Reinforcement Learning | —Unverified | 0 |
| Population-coding and Dynamic-neurons improved Spiking Actor Network for Reinforcement Learning | Jun 15, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| Global Rewards in Multi-Agent Deep Reinforcement Learning for Autonomous Mobility on Demand Systems | Dec 14, 2023 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| POMDP inference and robust solution via deep reinforcement learning: An application to railway optimal maintenance | Jul 16, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Meta Reinforcement Learning with Task Embedding and Shared Policy | May 16, 2019 | Deep Reinforcement LearningMeta-Learning | CodeCode Available | 0 |
| Deep RTS: A Game Environment for Deep Reinforcement Learning in Real-Time Strategy Games | Aug 15, 2018 | Deep Reinforcement LearningReal-Time Strategy Games | CodeCode Available | 0 |
| The Pump Scheduling Problem: A Real-World Scenario for Reinforcement Learning | Oct 20, 2022 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Global and Local Analysis of Interestingness for Competency-Aware Deep Reinforcement Learning | Nov 11, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| DASHA: Decentralized Autofocusing System with Hierarchical Agents | Aug 29, 2021 | Deep Reinforcement Learningobject-detection | CodeCode Available | 0 |
| Understanding the Evolution of Linear Regions in Deep Reinforcement Learning | Oct 24, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Deep reinforcement learning with time-scale invariant memory | Dec 19, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Dealing with Sparse Rewards in Reinforcement Learning | Oct 21, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Neural Replicator Dynamics | Jun 1, 2019 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| SPQR: Controlling Q-ensemble Independence with Spiked Random Model for Reinforcement Learning | Jan 6, 2024 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| MICo: Improved representations via sampling-based state similarity for Markov decision processes | Jun 3, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning | Jun 9, 2019 | Deep Reinforcement LearningGPU | CodeCode Available | 0 |
| GRAC: Self-Guided and Self-Regularized Actor-Critic | Sep 18, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Wasserstein Auto-encoded MDPs: Formal Verification of Efficiently Distilled RL Policies with Many-sided Guarantees | Mar 22, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 |
| MicroRacer: a didactic environment for Deep Reinforcement Learning | Mar 20, 2022 | Car RacingDeep Reinforcement Learning | CodeCode Available | 0 |
| GFN-SR: Symbolic Regression with Generative Flow Networks | Dec 1, 2023 | Deep Reinforcement LearningInterpretable Machine Learning | CodeCode Available | 0 |
| The State of Sparse Training in Deep Reinforcement Learning | Jun 17, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms | Feb 14, 2018 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| Graph Attention-based Deep Reinforcement Learning for solving the Chinese Postman Problem with Load-dependent costs | Oct 24, 2023 | ARCDeep Reinforcement Learning | CodeCode Available | 0 |
| DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning | Sep 15, 2021 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Graph Backup: Data Efficient Backup Exploiting Markovian Transitions | May 31, 2022 | Atari Gamescounterfactual | CodeCode Available | 0 |
| MineRL: A Large-Scale Dataset of Minecraft Demonstrations | Jul 29, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 0 |
| Variational Inference with Tail-adaptive f-Divergence | Oct 29, 2018 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Robust Policy Optimization in Deep Reinforcement Learning | Dec 14, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System | Aug 13, 2019 | Deep Reinforcement LearningQuestion Answering | CodeCode Available | 0 |
| Generative Market Equilibrium Models with Stable Adversarial Learning via Reinforcement | Apr 5, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| PPO Dash: Improving Generalization in Deep Reinforcement Learning | Jul 15, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| AirPilot: Interpretable PPO-based DRL Auto-Tuned Nonlinear PID Drone Controller for Robust Autonomous Flights | Mar 30, 2024 | Deep Reinforcement LearningDrone Controller | CodeCode Available | 0 |
| Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control | Dec 3, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework | May 5, 2025 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 |
| Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model | May 8, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Generating a Graph Colouring Heuristic with Deep Q-Learning and Graph Neural Networks | Apr 8, 2023 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 |
| Data Assimilation in Chaotic Systems Using Deep Reinforcement Learning | Jan 1, 2024 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 0 |
| UNIDOOR: A Universal Framework for Action-Level Backdoor Attacks in Deep Reinforcement Learning | Jan 26, 2025 | Backdoor AttackDeep Reinforcement Learning | CodeCode Available | 0 |
| MINOS: Multimodal Indoor Simulator for Navigation in Complex Environments | Dec 11, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Deep Reinforcement Learning with Swin Transformers | Jun 30, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |