| CEM-RL: Combining evolutionary and gradient-based methods for policy search | Oct 2, 2018 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Deep Multi-Agent Reinforcement Learning with Relevance Graphs | Nov 30, 2018 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 0 |
| Making Deep Q-learning methods robust to time discretization | Jan 28, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Direct Random Search for Fine Tuning of Deep Reinforcement Learning Policies | Sep 12, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Differentially Encoded Observation Spaces for Perceptive Reinforcement Learning | Oct 3, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Understanding Multi-Step Deep Reinforcement Learning: A Systematic Study of the DQN Target | Jan 22, 2019 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Deep learning-based numerical methods for high-dimensional parabolic partial differential equations and backward stochastic differential equations | Jun 15, 2017 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Deep Feature Space: A Geometrical Perspective | Jun 30, 2020 | Deep Reinforcement LearningDescriptive | CodeCode Available | 0 |
| Diagnosing Bottlenecks in Deep Q-learning Algorithms | Feb 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| PixelRL: Fully Convolutional Network with Reinforcement Learning for Image Processing | Dec 16, 2019 | Deep Reinforcement LearningDenoising | CodeCode Available | 0 |
| Causal Campbell-Goodhart's law and Reinforcement Learning | Nov 2, 2020 | Causal InferenceDecision Making | CodeCode Available | 0 |
| Map-based Experience Replay: A Memory-Efficient Solution to Catastrophic Forgetting in Reinforcement Learning | May 3, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 |
| Sparsifying Parametric Models with L0 Regularization | Sep 5, 2024 | Deep Reinforcement LearningDictionary Learning | CodeCode Available | 0 |
| The MineRL 2019 Competition on Sample Efficient Reinforcement Learning using Human Priors | Apr 22, 2019 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Dexterous Robotic Manipulation using Deep Reinforcement Learning and Knowledge Transfer for Complex Sparse Reward-based Tasks | May 19, 2022 | Deep Reinforcement LearningTransfer Learning | CodeCode Available | 0 |
| Trajectory-Based Off-Policy Deep Reinforcement Learning | May 14, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning | Jun 19, 2017 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Margin Trader: A Reinforcement Learning Framework for Portfolio Management with Margin and Constraints | Nov 25, 2023 | Deep Reinforcement LearningManagement | CodeCode Available | 0 |
| Playing Atari Games with Deep Reinforcement Learning and Human Checkpoint Replay | Jul 18, 2016 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Multi-task Learning and Catastrophic Forgetting in Continual Reinforcement Learning | Sep 22, 2019 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Developing a Chatbot system using Deep Learning based for Universities consultancy | Feb 28, 2022 | ChatbotDeep Learning | CodeCode Available | 0 |
| Deep Bayesian Bandits Showdown: An Empirical Comparison of Bayesian Deep Networks for Thompson Sampling | Feb 26, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Deterministic Implementations for Reproducibility in Deep Reinforcement Learning | Sep 15, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Playing Atari with Six Neurons | Jun 4, 2018 | Atari GamesDecision Making | CodeCode Available | 0 |
| Detecting Adversarial Attacks on Neural Network Policies with Visual Foresight | Oct 2, 2017 | Autonomous VehiclesDecision Making | CodeCode Available | 0 |
| Playing Doom with SLAM-Augmented Deep Reinforcement Learning | Dec 1, 2016 | Deep Reinforcement Learningobject-detection | CodeCode Available | 0 |
| Design Optimization of Nuclear Fusion Reactor through Deep Reinforcement Learning | Sep 12, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Massively Parallel Methods for Deep Reinforcement Learning | Jul 15, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Playing FPS Games with Deep Reinforcement Learning | Sep 18, 2016 | Deep Reinforcement LearningFPS Games | CodeCode Available | 0 |
| Deep Attention Q-Network for Personalized Treatment Recommendation | Jul 4, 2023 | Deep AttentionDeep Reinforcement Learning | CodeCode Available | 0 |
| Deploying Deep Reinforcement Learning Systems: A Taxonomy of Challenges | Aug 23, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Robust Deep Reinforcement Learning Scheduling via Weight Anchoring | Apr 20, 2023 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Decision Transformer under Random Frame Dropping | Mar 3, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Playing Text-Adventure Games with Graph-Based Deep Reinforcement Learning | Dec 4, 2018 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 |
| Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation | Sep 1, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 0 |
| Decision Theory-Guided Deep Reinforcement Learning for Fast Learning | Feb 8, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Decision-making and control with diffractive optical networks | Dec 21, 2022 | Autonomous DrivingCar Racing | CodeCode Available | 0 |
| Cascaded LSTMs based Deep Reinforcement Learning for Goal-driven Dialogue | Oct 31, 2019 | Deep Reinforcement LearningDialogue Management | CodeCode Available | 0 |
| Decentralized Computation Offloading for Multi-User Mobile Edge Computing: A Deep Reinforcement Learning Approach | Dec 16, 2018 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 0 |
| Dependability Analysis of Deep Reinforcement Learning based Robotics and Autonomous Systems through Probabilistic Model Checking | Sep 14, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Regret-Based Defense in Adversarial Reinforcement Learning | Feb 14, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| A Reinforcement Learning Approach for Robotic Unloading from Visual Observations | Sep 12, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Policies Modulating Trajectory Generators | Oct 7, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Policy Abstraction and Nash Refinement in Tree-Exploiting PSRO | Feb 5, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| DenseLight: Efficient Control for Large-scale Traffic Signals with Dense Feedback | Jun 13, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 0 |
| Policy Augmentation: An Exploration Strategy for Faster Convergence of Deep Reinforcement Learning Algorithms | Feb 10, 2021 | Deep Reinforcement LearningMatrix Completion | CodeCode Available | 0 |
| DRIBO: Robust Deep Reinforcement Learning via Multi-View Information Bottleneck | Feb 26, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Policy Consolidation for Continual Reinforcement Learning | Feb 1, 2019 | Continual Learningcontinuous-control | CodeCode Available | 0 |
| ARCHER: Aggressive Rewards to Counter bias in Hindsight Experience Replay | Sep 6, 2018 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Policy Distillation | Nov 19, 2015 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |