| Beacon, a lightweight deep reinforcement learning benchmark library for flow control | Feb 27, 2024 | BenchmarkingCPU | CodeCode Available | 1 | 5 |
| A2C is a special case of PPO | May 18, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| BeBold: Exploration Beyond the Boundary of Explored Regions | Dec 15, 2020 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 | 5 |
| A Constraint Enforcement Deep Reinforcement Learning Framework for Optimal Energy Storage Systems Dispatch | Jul 26, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints | Apr 18, 2023 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A multi-agent reinforcement learning model of common-pool resource appropriation | Jul 20, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Benchmarking Deep Reinforcement Learning for Navigation in Denied Sensor Environments | Oct 18, 2024 | Autonomous NavigationBenchmarking | CodeCode Available | 1 | 5 |
| Amortizing intractable inference in diffusion models for vision, language, and control | May 31, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Benchmarking Reinforcement Learning Techniques for Autonomous Navigation | Oct 10, 2022 | Autonomous NavigationBenchmarking | CodeCode Available | 1 | 5 |
| Learning Multi-Pursuit Evasion for Safe Targeted Navigation of Drones | Apr 7, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Delay-Aware Multi-Agent Reinforcement Learning for Cooperative and Competitive Environments | May 11, 2020 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Blockchain Framework for Artificial Intelligence Computation | Feb 23, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone | Dec 22, 2021 | Combinatorial OptimizationComputational Efficiency | CodeCode Available | 1 | 5 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 | 5 |
| Bridging RL Theory and Practice with the Effective Horizon | Apr 19, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning | Oct 23, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 | 5 |
| CORE: Towards Scalable and Efficient Causal Discovery with Reinforcement Learning | Jan 30, 2024 | Causal DiscoveryDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Bridging State and History Representations: Understanding Self-Predictive RL | Jan 17, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Building a 3-Player Mahjong AI using Deep Reinforcement Learning | Feb 25, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Action Branching Architectures for Deep Reinforcement Learning | Nov 24, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| MPC-Inspired Reinforcement Learning for Verifiable Model-Free Control | Dec 8, 2023 | Deep Reinforcement LearningModel Predictive Control | CodeCode Available | 1 | 5 |
| Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning | Jul 29, 2022 | Contrastive LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving | Feb 17, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Discriminative Particle Filter Reinforcement Learning for Complex Partial Observations | Feb 23, 2020 | Atari GamesDecision Making | CodeCode Available | 1 | 5 |
| Continuous-Time Fitted Value Iteration for Robust Policies | Oct 5, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |