| Contrastive Variational Reinforcement Learning for Complex Observations | Aug 6, 2020 | Atari GamesContinuous Control | CodeCode Available | 1 | 5 |
| CORE: Towards Scalable and Efficient Causal Discovery with Reinforcement Learning | Jan 30, 2024 | Causal DiscoveryDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Crowd-Robot Interaction: Crowd-aware Robot Navigation with Attention-based Deep Reinforcement Learning | Sep 24, 2018 | Deep Reinforcement LearningHuman Dynamics | CodeCode Available | 1 | 5 |
| Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning | Apr 14, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Continuous Coordination As a Realistic Scenario for Lifelong Learning | Mar 4, 2021 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| ContainerGym: A Real-World Reinforcement Learning Benchmark for Resource Allocation | Jul 6, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Continuous Deep Q-Learning with Model-based Acceleration | Mar 2, 2016 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Computational Performance of Deep Reinforcement Learning to find Nash Equilibria | Apr 26, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| An End-to-end Deep Reinforcement Learning Approach for the Long-term Short-term Planning on the Frenet Space | Nov 26, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Comparing Observation and Action Representations for Deep Reinforcement Learning in μRTS | Oct 26, 2019 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Continuous-Time Fitted Value Iteration for Robust Policies | Oct 5, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 | 5 |
| Combining Deep Reinforcement Learning and Search for Imperfect-Information Games | Jul 27, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Learning Multi-Pursuit Evasion for Safe Targeted Navigation of Drones | Apr 7, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies | Feb 1, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 | 5 |
| A multi-agent reinforcement learning model of common-pool resource appropriation | Jul 20, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Combining Reinforcement Learning and Constraint Programming for Combinatorial Optimization | Jun 2, 2020 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks | Jun 14, 2020 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Comprehensive Training and Evaluation on Deep Reinforcement Learning for Automated Driving in Various Simulated Driving Maneuvers | Jun 20, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| An Application of Deep Reinforcement Learning to Algorithmic Trading | Apr 7, 2020 | Algorithmic TradingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search | Dec 10, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Connecting Deep-Reinforcement-Learning-based Obstacle Avoidance with Conventional Global Planners using Waypoint Generators | Apr 8, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Contention Window Optimization in IEEE 802.11ax Networks with Deep Reinforcement Learning | Mar 3, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Continuous control with deep reinforcement learning | Sep 9, 2015 | Action Detectioncontinuous-control | CodeCode Available | 1 | 5 |
| An Introduction to Deep Reinforcement Learning | Nov 30, 2018 | BIG-bench Machine LearningDecision Making | CodeCode Available | 1 | 5 |
| An experimental evaluation of Deep Reinforcement Learning algorithms for HVAC control | Jan 11, 2024 | Deep Reinforcement LearningIncremental Learning | CodeCode Available | 1 | 5 |
| An Equivalence between Loss Functions and Non-Uniform Sampling in Experience Replay | Jul 12, 2020 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| Control-Informed Reinforcement Learning for Chemical Processes | Aug 24, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Re4MPC: Reactive Nonlinear MPC for Multi-model Motion Planning via Deep Reinforcement Learning | Jun 10, 2025 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Correlation-aware Cooperative Multigroup Broadcast 360° Video Delivery Network: A Hierarchical Deep Reinforcement Learning Approach | Oct 21, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| An Optimistic Perspective on Offline Deep Reinforcement Learning | Jan 1, 2020 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| CPU frequency scheduling of real-time applications on embedded devices with temporal encoding-based deep reinforcement learning | Sep 7, 2023 | CPUDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Platform-Agnostic Deep Reinforcement Learning Framework for Effective Sim2Real Transfer towards Autonomous Driving | Apr 14, 2023 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Aquatic Navigation: A Challenging Benchmark for Deep Reinforcement Learning | May 30, 2024 | Autonomous DrivingBenchmarking | CodeCode Available | 1 | 5 |
| CuAsmRL: Optimizing GPU SASS Schedules via Deep Reinforcement Learning | Jan 14, 2025 | Deep Reinforcement LearningGPU | CodeCode Available | 1 | 5 |
| Curiosity-Driven Energy-Efficient Worker Scheduling in Vehicular Crowdsourcing: A Deep Reinforcement Learning Approach | Apr 24, 2020 | Deep Reinforcement LearningFairness | CodeCode Available | 1 | 5 |
| Data-Efficient Reinforcement Learning with Self-Predictive Representations | Jul 12, 2020 | Atari Games 100kData Augmentation | CodeCode Available | 1 | 5 |
| Accelerated Sim-to-Real Deep Reinforcement Learning: Learning Collision Avoidance from Human Player | Feb 21, 2021 | Collision AvoidanceDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Reinforcement Learning Environment For Job-Shop Scheduling | Apr 8, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Reinforcement Learning Based Encoder-Decoder Framework for Learning Stock Trading Rules | Jan 8, 2021 | DecoderDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| ColO-RAN: Developing Machine Learning-based xApps for Open RAN Closed-loop Control on Programmable Experimental Platforms | Dec 17, 2021 | Deep Reinforcement LearningScheduling | CodeCode Available | 1 | 5 |
| DeepACO: Neural-enhanced Ant Systems for Combinatorial Optimization | Sep 25, 2023 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep Actor-Critic Learning for Distributed Power Control in Wireless Mobile Networks | Sep 14, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Deep Deterministic Portfolio Optimization | Mar 13, 2020 | Deep Reinforcement LearningPortfolio Optimization | CodeCode Available | 1 | 5 |
| A Closer Look at Invalid Action Masking in Policy Gradient Algorithms | Jun 25, 2020 | Deep Reinforcement LearningReal-Time Strategy Games | CodeCode Available | 1 | 5 |
| 2-Level Reinforcement Learning for Ships on Inland Waterways: Path Planning and Following | Jul 25, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Acme: A Research Framework for Distributed Reinforcement Learning | Jun 1, 2020 | Deep Reinforcement LearningDQN Replay Dataset | CodeCode Available | 1 | 5 |
| Deep Intrinsically Motivated Exploration in Continuous Control | Oct 1, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Actor Prioritized Experience Replay | Sep 1, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Amortizing intractable inference in diffusion models for vision, language, and control | May 31, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |