| A Study on Overfitting in Deep Reinforcement Learning | Apr 18, 2018 | Deep Reinforcement LearningInductive Bias | CodeCode Available | 0 |
| Learn to Steer through Deep Reinforcement Learning | Oct 27, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Soft Actor-Critic with Beta Policy via Implicit Reparameterization Gradients | Sep 8, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Leave no Trace: Learning to Reset for Safe and Autonomous Reinforcement Learning | Nov 18, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| A little less conversation, a little more action, please: Investigating the physical common-sense of LLMs in a 3D embodied environment | Oct 30, 2024 | Common Sense ReasoningDeep Reinforcement Learning | CodeCode Available | 0 |
| A Study of Plasticity Loss in On-Policy Deep Reinforcement Learning | May 29, 2024 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Left Ventricle Contouring in Cardiac Images Based on Deep Reinforcement Learning | Jun 8, 2021 | Deep Reinforcement LearningImage Segmentation | CodeCode Available | 0 |
| Diversity-based Deep Reinforcement Learning Towards Multidimensional Difficulty for Fighting Game AI | Nov 4, 2022 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| TRACED: Transition-aware Regret Approximation with Co-learnability for Environment Design | Jun 24, 2025 | Deep Reinforcement LearningZero-shot Generalization | CodeCode Available | 0 |
| Assessing the Potential of Classical Q-learning in General Game Playing | Oct 14, 2018 | Board GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Value Activation for Bias Alleviation: Generalized-activated Deep Double Deterministic Policy Gradients | Dec 21, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Weight Normalization: A Simple Reparameterization to Accelerate Training of Deep Neural Networks | Feb 25, 2016 | Deep Reinforcement LearningImage Classification | CodeCode Available | 0 |
| Let's Play Again: Variability of Deep Reinforcement Learning Agents in Atari Environments | Apr 12, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning Agents for Strategic Production Policies in Microeconomic Market Simulations | Oct 27, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Risk Conditioned Neural Motion Planning | Aug 4, 2021 | Deep Reinforcement LearningMotion Planning | CodeCode Available | 0 |
| Leveraging Contact Forces for Learning to Grasp | Sep 19, 2018 | Deep Reinforcement LearningObject | CodeCode Available | 0 |
| The Curse of Diversity in Ensemble-Based Exploration | May 7, 2024 | Attributecontinuous-control | CodeCode Available | 0 |
| Leveraging Demonstrations for Deep Reinforcement Learning on Robotics Problems with Sparse Rewards | Jul 27, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Risk-Sensitive Soft Actor-Critic for Robust Deep Reinforcement Learning under Distribution Shifts | Feb 15, 2024 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 0 |
| Parameter-free Reduction of the Estimation Bias in Deep Reinforcement Learning for Deterministic Policy Gradients | Sep 24, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Leveraging Knowledge Distillation for Efficient Deep Reinforcement Learning in Resource-Constrained Environments | Oct 16, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Reinforced Attention Regression for Partial Sketch Based Image Retrieval | Nov 21, 2021 | Deep Reinforcement LearningImage Retrieval | CodeCode Available | 0 |
| RL^2: Fast Reinforcement Learning via Slow Reinforcement Learning | Nov 9, 2016 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Revisiting Parameter Sharing in Multi-Agent Deep Reinforcement Learning | May 27, 2020 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Distributional Reward Estimation for Effective Multi-Agent Deep Reinforcement Learning | Oct 14, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Closed-loop deep learning: generating forward models with back-propagation | Jan 9, 2020 | Deep LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Parameter Space Noise for Exploration | Jun 6, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Leveraging Sequentiality in Reinforcement Learning from a Single Demonstration | Nov 9, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Parametric PDE Control with Deep Reinforcement Learning and Differentiable L0-Sparse Polynomial Policies | Mar 22, 2024 | Deep Reinforcement LearningDictionary Learning | CodeCode Available | 0 |
| Distributional Bellman Operators over Mean Embeddings | Dec 9, 2023 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Clipped-Objective Policy Gradients for Pessimistic Policy Optimization | Nov 10, 2023 | Deep Reinforcement LearningMulti-Task Learning | CodeCode Available | 0 |
| Deep Quality-Value (DQV) Learning | Sep 30, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Assessing Generalization in Deep Reinforcement Learning | Oct 29, 2018 | Deep Reinforcement LearningOut-of-Distribution Generalization | CodeCode Available | 0 |
| LIFT: Reinforcement Learning in Computer Systems by Learning From Demonstrations | Aug 23, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical Guarantees | Jul 10, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| TooBadRL: Trigger Optimization to Boost Effectiveness of Backdoor Attacks on Deep Reinforcement Learning | Jun 11, 2025 | Deep Reinforcement LearningSequential Decision Making | CodeCode Available | 0 |
| Solving Common-Payoff Games with Approximate Policy Iteration | Jan 11, 2021 | DecoderDeep Reinforcement Learning | CodeCode Available | 0 |
| Experimental Study on The Effect of Multi-step Deep Reinforcement Learning in POMDPs | Sep 12, 2022 | Deep Reinforcement Learning | CodeCode Available | 0 |
| AI Safety Gridworlds | Nov 27, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Solving Deep Reinforcement Learning Tasks with Evolution Strategies and Linear Policy Networks | Feb 10, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Network Randomization: A Simple Technique for Generalization in Deep Reinforcement Learning | Oct 11, 2019 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 0 |
| A Semantic-Aware Multiple Access Scheme for Distributed, Dynamic 6G-Based Applications | Jan 12, 2024 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| PathletRL++: Optimizing Trajectory Pathlet Extraction and Dictionary Formation via Reinforcement Learning | Dec 4, 2024 | Deep Reinforcement Learning | CodeCode Available | 0 |
| RLgraph: Modular Computation Graphs for Deep Reinforcement Learning | Oct 21, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Artificial Intelligence for Prosthetics - challenge solutions | Feb 7, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Deep Reinforcement Learning using Genetic Algorithm for Parameter Optimization | Feb 19, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Vision-based Navigation Using Deep Reinforcement Learning | Aug 8, 2019 | Deep Reinforcement LearningEfficient Neural Network | CodeCode Available | 0 |
| Pay Attention to What and Where? Interpretable Feature Extractor in Vision-based Deep Reinforcement Learning | Apr 14, 2025 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Deep Q-Network for Angry Birds | Oct 4, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Distributed-Training-and-Execution Multi-Agent Reinforcement Learning for Power Control in HetNet | Dec 15, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |