| Deep Reinforcement Learning for URLLC data management on top of scheduled eMBB traffic | Mar 2, 2021 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| Optimal Transport Perturbations for Safe Reinforcement Learning with Robustness Guarantees | Jan 31, 2023 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| An experimental evaluation of Deep Reinforcement Learning algorithms for HVAC control | Jan 11, 2024 | Deep Reinforcement LearningIncremental Learning | CodeCode Available | 1 | 5 |
| Geometric Deep Reinforcement Learning for Dynamic DAG Scheduling | Nov 9, 2020 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations | Apr 6, 2022 | Contrastive LearningDecision Making | CodeCode Available | 1 | 5 |
| Parameterized Decision-making with Multi-modal Perception for Autonomous Driving | Dec 19, 2023 | Autonomous DrivingAutonomous Vehicles | CodeCode Available | 1 | 5 |
| PeersimGym: An Environment for Solving the Task Offloading Problem with Reinforcement Learning | Mar 26, 2024 | Deep Reinforcement LearningDistributed Computing | CodeCode Available | 1 | 5 |
| Performance Comparison of Deep RL Algorithms for Energy Systems Optimal Scheduling | Aug 1, 2022 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 | 5 |
| Gradient Surgery for Multi-Task Learning | Jan 19, 2020 | Deep Reinforcement Learningimage-classification | CodeCode Available | 1 | 5 |
| PIC4rl-gym: a ROS2 modular framework for Robots Autonomous Navigation with Deep Reinforcement Learning | Nov 19, 2022 | Autonomous NavigationBenchmarking | CodeCode Available | 1 | 5 |
| Playing Atari with Deep Reinforcement Learning | Dec 19, 2013 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Asynchronous Methods for Deep Reinforcement Learning | Feb 4, 2016 | Atari GamesCPU | CodeCode Available | 1 | 5 |
| Playing Pokémon Red via Deep Reinforcement Learning | Feb 27, 2025 | Deep Reinforcement LearningLanguage Modeling | CodeCode Available | 1 | 5 |
| PlayVirtual: Augmenting Cycle-Consistent Virtual Trajectories for Reinforcement Learning | Jun 8, 2021 | Continuous Control (100k environment steps)Continuous Control (500k environment steps) | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning with Double Q-learning | Sep 22, 2015 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Tactical Optimism and Pessimism for Deep Reinforcement Learning | Feb 7, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| A fast balance optimization approach for charging enhancement of lithium-ion battery packs through deep reinforcement learning | Apr 24, 2024 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 | 5 |
| SoundSpaces: Audio-Visual Navigation in 3D Environments | Dec 24, 2019 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 | 5 |
| Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement Learning | Jun 10, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Text-based Deep Reinforcement Learning Framework for Interactive Recommendation | Apr 14, 2020 | Deep Reinforcement LearningInteractive Recommendation | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning with Task-Adaptive Retrieval via Hypernetwork | Jun 19, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning with Population-Coded Spiking Neural Network for Continuous Control | Oct 19, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Deep Reinforcement Trading with Predictable Returns | Apr 29, 2021 | ClusteringDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Pretrained Encoders are All You Need | Jun 9, 2021 | AllContrastive Learning | CodeCode Available | 1 | 5 |
| A Traffic Light Dynamic Control Algorithm with Deep Reinforcement Learning Based on GNN Prediction | Sep 29, 2020 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 1 | 5 |
| Deep reinforcement learning-designed radiofrequency waveform in MRI | May 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Affordance Learning from Play for Sample-Efficient Policy Learning | Mar 1, 2022 | Deep Reinforcement LearningMotion Planning | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Generalizing Across Multi-Objective Reward Functions in Deep Reinforcement Learning | Sep 17, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 | 5 |
| Defeating Proactive Jammers Using Deep Reinforcement Learning for Resource-Constrained IoT Networks | Jul 13, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Quantum Architecture Search via Deep Reinforcement Learning | Apr 15, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and Competitive Environments | May 11, 2020 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Augmenting Reinforcement Learning with Behavior Primitives for Diverse Manipulation Tasks | Oct 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| DeepMimic: Example-Guided Deep Reinforcement Learning of Physics-Based Character Skills | Apr 8, 2018 | Deep Reinforcement LearningMotion Synthesis | CodeCode Available | 1 | 5 |
| Recurrent Hypernetworks are Surprisingly Strong in Meta-RL | Sep 26, 2023 | Deep Reinforcement LearningFew-Shot Learning | CodeCode Available | 1 | 5 |
| Developing an OpenAI Gym-compatible framework and simulation environment for testing Deep Reinforcement Learning agents solving the Ambulance Location Problem | Jan 12, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Generative Artificial Intelligence (GAI) for Mobile Communications: A Diffusion Model Perspective | Oct 8, 2024 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| Learning Dexterous Grasping with Object-Centric Visual Affordances | Sep 3, 2020 | Deep Reinforcement LearningObject | CodeCode Available | 1 | 5 |
| Reinforced active learning for image segmentation | Feb 16, 2020 | Active LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Age-Based Scheduling for Mobile Edge Computing: A Deep Reinforcement Learning Approach | Dec 1, 2023 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 1 | 5 |
| GRAM: Generalization in Deep RL with a Robust Adaptation Module | Dec 5, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Gym-μRTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning | May 21, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 1 | 5 |
| Differentiable Trust Region Layers for Deep Reinforcement Learning | Jan 22, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| DIMES: A Differentiable Meta Solver for Combinatorial Optimization Problems | Oct 8, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction | Mar 16, 2020 | Deep Reinforcement LearningMeta-Learning | CodeCode Available | 1 | 5 |
| HyperDQN: A Randomized Exploration Method for Deep Reinforcement Learning | Sep 29, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 | 5 |
| Reinforcement Learning for Contact-Rich Tasks: Robotic Peg Insertion Strategies | Dec 14, 2020 | Contact-rich ManipulationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design | Oct 4, 2023 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 1 | 5 |
| Meta-AAD: Active Anomaly Detection with Deep Reinforcement Learning | Sep 16, 2020 | Anomaly DetectionDeep Reinforcement Learning | CodeCode Available | 1 | 5 |