| Accelerated Methods for Deep Reinforcement Learning | Mar 7, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 2 |
| Flow: A Modular Learning Framework for Mixed Autonomy Traffic | Oct 16, 2017 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 2 |
| Benchmarking Deep Reinforcement Learning for Continuous Control | Apr 22, 2016 | Action Triplet RecognitionAtari Games | CodeCode Available | 2 |
| Deep Reinforcement Learning with Gradient Eligibility Traces | Jul 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement Learning | Jun 10, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| The Cell Must Go On: Agar.io for Continual Reinforcement Learning | May 23, 2025 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 1 |
| GATES: Cost-aware Dynamic Workflow Scheduling via Graph Attention Networks and Evolution Strategy | May 18, 2025 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 |
| Reasoning on a Budget: Miniaturizing DeepSeek R1 with SFT-GRPO Alignment for Instruction-Tuned LLMs | May 16, 2025 | Deep Reinforcement LearningMathematical Reasoning | CodeCode Available | 1 |
| Evaluating Robustness of Deep Reinforcement Learning for Autonomous Surface Vehicle Control in Field Tests | May 15, 2025 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| Enhancing Cooperative Multi-Agent Reinforcement Learning with State Modelling and Adversarial Exploration | May 8, 2025 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Neurophysiologically Realistic Environment for Comparing Adaptive Deep Brain Stimulation Algorithms in Parkinson Disease | Apr 26, 2025 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Learning Decision Trees as Amortized Structure Inference | Mar 10, 2025 | Anomaly DetectionDeep Reinforcement Learning | CodeCode Available | 1 |
| Dynamics-Invariant Quadrotor Control using Scale-Aware Deep Reinforcement Learning | Mar 9, 2025 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Studying the Interplay Between the Actor and Critic Representations in Reinforcement Learning | Mar 8, 2025 | Deep Reinforcement LearningRepresentation Learning | CodeCode Available | 1 |
| Playing Pokémon Red via Deep Reinforcement Learning | Feb 27, 2025 | Deep Reinforcement LearningLanguage Modeling | CodeCode Available | 1 |
| ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic Environments | Feb 27, 2025 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| Towards Optimal Adversarial Robust Reinforcement Learning with Infinity Measurement Error | Feb 23, 2025 | Adversarial RobustnessDeep Reinforcement Learning | CodeCode Available | 1 |
| Reevaluating Policy Gradient Methods for Imperfect-Information Games | Feb 13, 2025 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 |
| A Comprehensive Survey on Self-Interpretable Neural Networks | Jan 26, 2025 | Deep Reinforcement LearningSurvey | CodeCode Available | 1 |
| Divergence-Augmented Policy Optimization | Jan 25, 2025 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Hierarchical Deep Reinforcement Learning for Adaptive Resource Management in Integrated Terrestrial and Non-Terrestrial Networks | Jan 16, 2025 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning | Jan 14, 2025 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| Co-Activation Graph Analysis of Safety-Verified and Explainable Deep Reinforcement Learning Policies | Jan 6, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Plug-and-Play PPO: An Adaptive Point Prompt Optimizer Making SAM Greater | Jan 1, 2025 | Deep Reinforcement LearningSegmentation | CodeCode Available | 1 |
| GRAM: Generalization in Deep RL with a Robust Adaptation Module | Dec 5, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |