| ColO-RAN: Developing Machine Learning-based xApps for Open RAN Closed-loop Control on Programmable Experimental Platforms | Dec 17, 2021 | Deep Reinforcement LearningScheduling | CodeCode Available | 1 | 5 |
| Collective eXplainable AI: Explaining Cooperative Strategies and Agent Contribution in Multiagent Reinforcement Learning with Shapley Values | Oct 4, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric Vehicles | Feb 22, 2022 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 | 5 |
| A Comparative Study of Deep Reinforcement Learning Models: DQN vs PPO vs A2C | Jul 19, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization | Jun 2, 2020 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings | Nov 25, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 | 5 |
| Comprehensive Training and Evaluation on Deep Reinforcement Learning for Automated Driving in Various Simulated Driving Maneuvers | Jun 20, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Competition Winning Deep Reinforcement Learning Agent in microRTS | Feb 12, 2024 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 | 5 |
| ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic Environments | Feb 27, 2025 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Comprehensive Survey on Self-Interpretable Neural Networks | Jan 26, 2025 | Deep Reinforcement LearningSurvey | CodeCode Available | 1 | 5 |
| An Application of Deep Reinforcement Learning to Algorithmic Trading | Apr 7, 2020 | Algorithmic TradingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning | Jun 3, 2021 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 | 5 |
| A2C is a special case of PPO | May 18, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| A Constraint Enforcement Deep Reinforcement Learning Framework for Optimal Energy Storage Systems Dispatch | Jul 26, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Continuous Deep Q-Learning with Model-based Acceleration | Mar 2, 2016 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| The Animal-AI Environment: A Virtual Laboratory For Comparative Cognition and Artificial Intelligence Research | Dec 18, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| An Introduction to Deep Reinforcement Learning | Nov 30, 2018 | BIG-bench Machine LearningDecision Making | CodeCode Available | 1 | 5 |
| Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning | Jul 29, 2022 | Contrastive LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| An Optimistic Perspective on Offline Deep Reinforcement Learning | Jan 1, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| CORE: Towards Scalable and Efficient Causal Discovery with Reinforcement Learning | Jan 30, 2024 | Causal DiscoveryDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| CoTV: Cooperative Control for Traffic Light Signals and Connected Autonomous Vehicles using Deep Reinforcement Learning | Jan 31, 2022 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Action Branching Architectures for Deep Reinforcement Learning | Nov 24, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation | Jul 27, 2022 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| CrossQ: Batch Normalization in Deep Reinforcement Learning for Greater Sample Efficiency and Simplicity | Feb 14, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| A Platform-Agnostic Deep Reinforcement Learning Framework for Effective Sim2Real Transfer towards Autonomous Driving | Apr 14, 2023 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Cryptocurrency Portfolio Management with Deep Reinforcement Learning | Dec 5, 2016 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 | 5 |
| Collaborative Target Search with a Visual Drone Swarm: An Adaptive Curriculum Embedded Multistage Reinforcement Learning Approach | Apr 26, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| Acme: A Research Framework for Distributed Reinforcement Learning | Jun 1, 2020 | Deep Reinforcement LearningDQN Replay Dataset | CodeCode Available | 1 | 5 |
| A Closer Look at Invalid Action Masking in Policy Gradient Algorithms | Jun 25, 2020 | Deep Reinforcement LearningReal-Time Strategy Games | CodeCode Available | 1 | 5 |
| 2-Level Reinforcement Learning for Ships on Inland Waterways: Path Planning and Following | Jul 25, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Amortizing intractable inference in diffusion models for vision, language, and control | May 31, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Learning Multi-Pursuit Evasion for Safe Targeted Navigation of Drones | Apr 7, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A multi-agent reinforcement learning model of common-pool resource appropriation | Jul 20, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| An End-to-end Deep Reinforcement Learning Approach for the Long-term Short-term Planning on the Frenet Space | Nov 26, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Combining Deep Reinforcement Learning and Search for Imperfect-Information Games | Jul 27, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Continuous control with deep reinforcement learning | Sep 9, 2015 | Action Detectioncontinuous-control | CodeCode Available | 1 | 5 |
| Data-Efficient Reinforcement Learning with Self-Predictive Representations | Jul 12, 2020 | Atari Games 100kData Augmentation | CodeCode Available | 1 | 5 |
| Chip Placement with Deep Reinforcement Learning | Apr 22, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Cleanba: A Reproducible and Efficient Distributed Reinforcement Learning Platform | Sep 29, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| CDT: Cascading Decision Trees for Explainable Reinforcement Learning | Nov 15, 2020 | Deep Reinforcement LearningExplainable Models | CodeCode Available | 1 | 5 |
| AllenAct: A Framework for Embodied AI Research | Aug 28, 2020 | Deep Reinforcement LearningEmbodied Question Answering | CodeCode Available | 1 | 5 |
| Character Controllers Using Motion VAEs | Mar 26, 2021 | Continuous ControlDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| CARL: Controllable Agent with Reinforcement Learning for Quadruped Locomotion | May 7, 2020 | Deep Reinforcement LearningMotion Synthesis | CodeCode Available | 1 | 5 |
| Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement Learning | Jun 10, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Catalyst.RL: A Distributed Framework for Reproducible RL Research | Feb 28, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving | Feb 17, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Building a 3-Player Mahjong AI using Deep Reinforcement Learning | Feb 25, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Can Increasing Input Dimensionality Improve Deep Reinforcement Learning? | Mar 3, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Training a Resilient Q-Network against Observational Interference | Feb 18, 2021 | Causal InferenceDeep Reinforcement Learning | CodeCode Available | 1 | 5 |