| An Introduction to Deep Reinforcement Learning | Nov 30, 2018 | BIG-bench Machine LearningDecision Making | CodeCode Available | 1 | 5 |
| Control-Informed Reinforcement Learning for Chemical Processes | Aug 24, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Correlation-aware Cooperative Multigroup Broadcast 360° Video Delivery Network: A Hierarchical Deep Reinforcement Learning Approach | Oct 21, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Cryptocurrency Portfolio Management with Deep Reinforcement Learning | Dec 5, 2016 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Continuous Deep Q-Learning with Model-based Acceleration | Mar 2, 2016 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Contention Window Optimization in IEEE 802.11ax Networks with Deep Reinforcement Learning | Mar 3, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Continuous-Time Fitted Value Iteration for Robust Policies | Oct 5, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Computational Performance of Deep Reinforcement Learning to find Nash Equilibria | Apr 26, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Comprehensive Training and Evaluation on Deep Reinforcement Learning for Automated Driving in Various Simulated Driving Maneuvers | Jun 20, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Connecting Deep-Reinforcement-Learning-based Obstacle Avoidance with Conventional Global Planners using Waypoint Generators | Apr 8, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning | Jul 29, 2022 | Contrastive LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Combining Deep Reinforcement Learning and Search for Imperfect-Information Games | Jul 27, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 | 5 |
| Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization | Jun 2, 2020 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Learning Multi-Pursuit Evasion for Safe Targeted Navigation of Drones | Apr 7, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies | Feb 1, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| An End-to-end Deep Reinforcement Learning Approach for the Long-term Short-term Planning on the Frenet Space | Nov 26, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Comparing Observation and Action Representations for Deep Reinforcement Learning in μRTS | Oct 26, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| A multi-agent reinforcement learning model of common-pool resource appropriation | Jul 20, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| An Application of Deep Reinforcement Learning to Algorithmic Trading | Apr 7, 2020 | Algorithmic TradingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search | Dec 10, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation | Jul 6, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Continuous control with deep reinforcement learning | Sep 9, 2015 | Action Detectioncontinuous-control | CodeCode Available | 1 | 5 |
| Continuous Coordination As a Realistic Scenario for Lifelong Learning | Mar 4, 2021 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Combining Semantic Guidance and Deep Reinforcement Learning For Generating Human Level Paintings | Nov 25, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 | 5 |
| An experimental evaluation of Deep Reinforcement Learning algorithms for HVAC control | Jan 11, 2024 | Deep Reinforcement LearningIncremental Learning | CodeCode Available | 1 | 5 |
| The Animal-AI Environment: A Virtual Laboratory For Comparative Cognition and Artificial Intelligence Research | Dec 18, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay | Jul 12, 2020 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement Learning | Jun 10, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| CoTV: Cooperative Control for Traffic Light Signals and Connected Autonomous Vehicles using Deep Reinforcement Learning | Jan 31, 2022 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| An Optimistic Perspective on Offline Deep Reinforcement Learning | Jan 1, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Cross-Modal Contrastive Learning of Representations for Navigation using Lightweight, Low-Cost Millimeter Wave Radar for Adverse Environmental Conditions | Jan 10, 2021 | Autonomous NavigationContrastive Learning | CodeCode Available | 1 | 5 |
| A Platform-Agnostic Deep Reinforcement Learning Framework for Effective Sim2Real Transfer towards Autonomous Driving | Apr 14, 2023 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning | May 30, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 1 | 5 |
| Curiosity-Driven Energy-Efficient Worker Scheduling in Vehicular Crowdsourcing: A Deep Reinforcement Learning Approach | Apr 24, 2020 | Deep Reinforcement LearningFairness | CodeCode Available | 1 | 5 |
| Curriculum-guided Hindsight Experience Replay | Dec 1, 2019 | Deep Reinforcement LearningDiversity | CodeCode Available | 1 | 5 |
| Accelerated Sim-to-Real Deep Reinforcement Learning: Learning Collision Avoidance from Human Player | Feb 21, 2021 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Decentralized Deep Reinforcement Learning for a Distributed and Adaptive Locomotion Controller of a Hexapod Robot | May 21, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| A Reinforcement Learning Environment For Job-Shop Scheduling | Apr 8, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Reinforcement Learning Based Encoder-Decoder Framework for Learning Stock Trading Rules | Jan 8, 2021 | DecoderDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| ColorDynamic: Generalizable, Scalable, Real-time, End-to-end Local Planner for Unstructured and Dynamic Environments | Feb 27, 2025 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Actor Prioritized Experience Replay | Sep 1, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Deep Active Inference for Partially Observable MDPs | Sep 8, 2020 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 | 5 |
| Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks | Sep 14, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| A Closer Look at Invalid Action Masking in Policy Gradient Algorithms | Jun 25, 2020 | Deep Reinforcement LearningReal-Time Strategy Games | CodeCode Available | 1 | 5 |
| 2-Level Reinforcement Learning for Ships on Inland Waterways: Path Planning and Following | Jul 25, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Acme: A Research Framework for Distributed Reinforcement Learning | Jun 1, 2020 | Deep Reinforcement LearningDQN Replay Dataset | CodeCode Available | 1 | 5 |
| Deep Generalized Schrödinger Bridge | Sep 20, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Traffic Light Dynamic Control Algorithm with Deep Reinforcement Learning Based on GNN Prediction | Sep 29, 2020 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 1 | 5 |
| Amortizing intractable inference in diffusion models for vision, language, and control | May 31, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |