| Gossip-based Actor-Learner Architectures for Deep Reinforcement Learning | Jun 9, 2019 | Deep Reinforcement LearningGPU | CodeCode Available | 0 |
| GRAC: Self-Guided and Self-Regularized Actor-Critic | Sep 18, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Wasserstein Auto-encoded MDPs: Formal Verification of Efficiently Distilled RL Policies with Many-sided Guarantees | Mar 22, 2023 | Deep Reinforcement Learning | CodeCode Available | 0 |
| MicroRacer: a didactic environment for Deep Reinforcement Learning | Mar 20, 2022 | Car RacingDeep Reinforcement Learning | CodeCode Available | 0 |
| GFN-SR: Symbolic Regression with Generative Flow Networks | Dec 1, 2023 | Deep Reinforcement LearningInterpretable Machine Learning | CodeCode Available | 0 |
| The State of Sparse Training in Deep Reinforcement Learning | Jun 17, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms | Feb 14, 2018 | Deep Reinforcement LearningDiversity | CodeCode Available | 0 |
| Graph Attention-based Deep Reinforcement Learning for solving the Chinese Postman Problem with Load-dependent costs | Oct 24, 2023 | ARCDeep Reinforcement Learning | CodeCode Available | 0 |
| DCUR: Data Curriculum for Teaching via Samples with Reinforcement Learning | Sep 15, 2021 | Deep Reinforcement LearningOffline RL | CodeCode Available | 0 |
| Graph Backup: Data Efficient Backup Exploiting Markovian Transitions | May 31, 2022 | Atari Gamescounterfactual | CodeCode Available | 0 |
| MineRL: A Large-Scale Dataset of Minecraft Demonstrations | Jul 29, 2019 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 0 |
| Variational Inference with Tail-adaptive f-Divergence | Oct 29, 2018 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 0 |
| Robust Policy Optimization in Deep Reinforcement Learning | Dec 14, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Generative Question Refinement with Deep Reinforcement Learning in Retrieval-based QA System | Aug 13, 2019 | Deep Reinforcement LearningQuestion Answering | CodeCode Available | 0 |
| Generative Market Equilibrium Models with Stable Adversarial Learning via Reinforcement | Apr 5, 2025 | Deep Reinforcement Learning | CodeCode Available | 0 |
| PPO Dash: Improving Generalization in Deep Reinforcement Learning | Jul 15, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| AirPilot: Interpretable PPO-based DRL Auto-Tuned Nonlinear PID Drone Controller for Robust Autonomous Flights | Mar 30, 2024 | Deep Reinforcement LearningDrone Controller | CodeCode Available | 0 |
| Visual Foresight: Model-Based Deep Reinforcement Learning for Vision-Based Robotic Control | Dec 3, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Graph Neural Network-Based Reinforcement Learning for Controlling Biological Networks: The GATTACA Framework | May 5, 2025 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 |
| Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model | May 8, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Generating a Graph Colouring Heuristic with Deep Q-Learning and Graph Neural Networks | Apr 8, 2023 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 0 |
| Data Assimilation in Chaotic Systems Using Deep Reinforcement Learning | Jan 1, 2024 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 0 |
| UNIDOOR: A Universal Framework for Action-Level Backdoor Attacks in Deep Reinforcement Learning | Jan 26, 2025 | Backdoor AttackDeep Reinforcement Learning | CodeCode Available | 0 |
| MINOS: Multimodal Indoor Simulator for Navigation in Complex Environments | Dec 11, 2017 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Deep Reinforcement Learning with Swin Transformers | Jun 30, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |