| Improving Environment Robustness of Deep Reinforcement Learning Approaches for Autonomous Racing Using Bayesian Optimization-based Curriculum Learning | Dec 16, 2023 | Autonomous DrivingAutonomous Racing | CodeCode Available | 0 |
| Improving Exploration in Evolution Strategies for Deep Reinforcement Learning via a Population of Novelty-Seeking Agents | Dec 18, 2017 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Improving Exploration in Soft-Actor-Critic with Normalizing Flows Policies | Jun 6, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Convex Is Back: Solving Belief MDPs With Convexity-Informed Deep Reinforcement Learning | Feb 13, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Scalable Volt-VAR Optimization using RLlib-IMPALA Framework: A Reinforcement Learning Approach | Feb 24, 2024 | Deep Reinforcement LearningDistributed Computing | CodeCode Available | 0 |
| Fast deep reinforcement learning using online adjustments from the past | Oct 18, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Trust Region-Guided Proximal Policy Optimization | Jan 29, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Improving Generalization on the ProcGen Benchmark with Simple Architectural Changes and Scale | Oct 13, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Trust-Region Twisted Policy Improvement | Apr 8, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Quantum Deep Reinforcement Learning for Robot Navigation Tasks | Feb 24, 2022 | BIG-bench Machine LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Towards Better Interpretability in Deep Q-Networks | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Improving Optimization Bounds using Machine Learning: Decision Diagrams meet Deep Reinforcement Learning | Sep 10, 2018 | BIG-bench Machine LearningCombinatorial Optimization | CodeCode Available | 0 |
| Conversational Tree Search: A New Hybrid Dialog Task | Mar 17, 2023 | Deep Reinforcement LearningInformation Retrieval | CodeCode Available | 0 |
| Improving Policy Optimization with Generalist-Specialist Learning | Jun 26, 2022 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Bayesian Optimization for Iterative Learning | Sep 20, 2019 | Bayesian OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| Improving Robustness of Deep Reinforcement Learning Agents: Environment Attack based on the Critic Network | Apr 7, 2021 | Adversarial AttackDeep Reinforcement Learning | CodeCode Available | 0 |
| Budgeted Reinforcement Learning in Continuous State Space | Mar 3, 2019 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 0 |
| AC-Teach: A Bayesian Actor-Critic Method for Policy Learning with an Ensemble of Suboptimal Teachers | Sep 9, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| An Intelligent SDWN Routing Algorithm Based on Network Situational Awareness and Deep Reinforcement Learning | May 12, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning from Hierarchical Preference Design | Sep 6, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Query-based Targeted Action-Space Adversarial Policies on Deep Reinforcement Learning Agents | Nov 13, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Towards Closing the Sim-to-Real Gap in Collaborative Multi-Robot Deep Reinforcement Learning | Aug 18, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Queueing Network Controls via Deep Reinforcement Learning | Jul 31, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Super Reinforcement Bros: Playing Super Mario Bros with Reinforcement Learning | Dec 14, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Q-Value Weighted Regression: Reinforcement Learning with Limited Data | Feb 12, 2021 | Atari Gamescontinuous-control | CodeCode Available | 0 |