| Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic | Nov 7, 2016 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Federated Control with Hierarchical Multi-Agent Deep Reinforcement Learning | Dec 22, 2017 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 |
| QT-Opt: Scalable Deep Reinforcement Learning for Vision-Based Robotic Manipulation | Jun 27, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Counterfactual State Explanations for Reinforcement Learning Agents via Generative Deep Learning | Jan 29, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| Importance Prioritized Policy Distillation | Aug 25, 2022 | Atari GamesDecision Making | CodeCode Available | 0 |
| Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection Approach | Jan 1, 2021 | Deep Reinforcement LearningFault Detection | CodeCode Available | 0 |
| Deep Reinforcement Learning Methods for Structure-Guided Processing Path Optimization | Sep 21, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Counterfactual Explainer Framework for Deep Reinforcement Learning Models Using Policy Distillation | May 25, 2023 | counterfactualCounterfactual Explanation | CodeCode Available | 0 |
| TrojDRL: Trojan Attacks on Deep Reinforcement Learning Agents | Mar 1, 2019 | Data PoisoningDeep Reinforcement Learning | CodeCode Available | 0 |
| TrolleyMod v1.0: An Open-Source Simulation and Data-Collection Platform for Ethical Decision Making in Autonomous Vehicles | Nov 14, 2018 | Autonomous VehiclesDecision Making | CodeCode Available | 0 |
| Improved robustness of reinforcement learning policies upon conversion to spiking neuronal network platforms applied to ATARI games | Mar 26, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Student-Initiated Action Advising via Advice Novelty | Oct 1, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Correcting Momentum in Temporal Difference Learning | Jun 7, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| An Intentional Forgetting-Driven Self-Healing Method For Deep Reinforcement Learning Systems | Aug 23, 2023 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Quantile-Based Deep Reinforcement Learning using Two-Timescale Policy Gradient Algorithms | May 12, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Truly Proximal Policy Optimization | Mar 19, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Improving Automatic Source Code Summarization via Deep Reinforcement Learning | Nov 17, 2018 | Code SummarizationDecoder | CodeCode Available | 0 |
| Bayesian Optimization with Robust Bayesian Neural Networks | Dec 1, 2016 | Bayesian OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| QuaRL: Quantization for Fast and Environmentally Sustainable Reinforcement Learning | Oct 2, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Fast Matrix Multiplication Without Tears: A Constraint Programming Approach | Jun 1, 2023 | Deep Reinforcement LearningProblem Decomposition | CodeCode Available | 0 |
| Improving Coordination in Small-Scale Multi-Agent Deep Reinforcement Learning through Memory-driven Communication | Jan 12, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Deep Innovation Protection: Confronting the Credit Assignment Problem in Training Heterogeneous Neural Architectures | Dec 29, 2019 | Deep Reinforcement LearningMultiobjective Optimization | CodeCode Available | 0 |
| Improving Deep Reinforcement Learning by Reducing the Chain Effect of Value and Policy Churn | Sep 7, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Urban Driving with Multi-Objective Deep Reinforcement Learning | Nov 21, 2018 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep reinforcement learning from human preferences | Jun 12, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Improving Environment Robustness of Deep Reinforcement Learning Approaches for Autonomous Racing Using Bayesian Optimization-based Curriculum Learning | Dec 16, 2023 | Autonomous DrivingAutonomous Racing | CodeCode Available | 0 |
| Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents | Dec 18, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies | Jun 6, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement Learning | Feb 13, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Scalable Volt-VAR Optimization using RLlib-IMPALA Framework: A Reinforcement Learning Approach | Feb 24, 2024 | Deep Reinforcement LearningDistributed Computing | CodeCode Available | 0 |
| Fast deep reinforcement learning using online adjustments from the past | Oct 18, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Trust Region-Guided Proximal Policy Optimization | Jan 29, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and Scale | Oct 13, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Trust-Region Twisted Policy Improvement | Apr 8, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Quantum Deep Reinforcement Learning for Robot Navigation Tasks | Feb 24, 2022 | BIG-bench Machine LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Towards Better Interpretability in Deep Q-Networks | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement Learning | Sep 10, 2018 | BIG-bench Machine LearningCombinatorial Optimization | CodeCode Available | 0 |
| Conversational Tree Search: A New Hybrid Dialog Task | Mar 17, 2023 | Deep Reinforcement LearningInformation Retrieval | CodeCode Available | 0 |
| Improving Policy Optimization with Generalist-Specialist Learning | Jun 26, 2022 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Bayesian Optimization for Iterative Learning | Sep 20, 2019 | Bayesian OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| Improving Robustness of Deep Reinforcement Learning Agents: Environment Attack based on the Critic Network | Apr 7, 2021 | Adversarial AttackDeep Reinforcement Learning | CodeCode Available | 0 |
| Budgeted Reinforcement Learning in Continuous State Space | Mar 3, 2019 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 0 |
| AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers | Sep 9, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| An Intelligent SDWN Routing Algorithm Based on Network Situational Awareness and Deep Reinforcement Learning | May 12, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning from Hierarchical Preference Design | Sep 6, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Query-based Targeted Action-Space Adversarial Policies on Deep Reinforcement Learning Agents | Nov 13, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Towards Closing the Sim-to-Real Gap in Collaborative Multi-Robot Deep Reinforcement Learning | Aug 18, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Queueing Network Controls via Deep Reinforcement Learning | Jul 31, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Super Reinforcement Bros: Playing Super Mario Bros with Reinforcement Learning | Dec 14, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Q-Value Weighted Regression: Reinforcement Learning with Limited Data | Feb 12, 2021 | Atari Gamescontinuous-control | CodeCode Available | 0 |