| Symmetry Considerations for Learning Task Symmetric Robot Policies | Mar 7, 2024 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 7 | 5 |
| The Dormant Neuron Phenomenon in Deep Reinforcement Learning | Feb 24, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 6 | 5 |
| Dynamic Datasets and Market Environments for Financial Reinforcement Learning | Apr 25, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 6 | 5 |
| FinRL-Meta: Market Environments and Benchmarks for Data-Driven Financial Reinforcement Learning | Nov 6, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 6 | 5 |
| That Chip Has Sailed: A Critique of Unfounded Skepticism Around AI for Chip Design | Nov 15, 2024 | Deep Reinforcement Learning | CodeCode Available | 5 | 5 |
| DeXtreme: Transfer of Agile In-hand Manipulation from Simulation to Reality | Oct 25, 2022 | Deep Reinforcement LearningGPU | CodeCode Available | 4 | 5 |
| Discovering faster matrix multiplication algorithms with reinforcement learning | Oct 5, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 4 | 5 |
| RL4CO: an Extensive Reinforcement Learning for Combinatorial Optimization Benchmark | Jun 29, 2023 | Combinatorial OptimizationComputational Efficiency | CodeCode Available | 4 | 5 |
| Learning Bipedal Walking for Humanoids with Current Feedback | Mar 7, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 3 | 5 |
| CleanRL: High-quality Single-file Implementations of Deep Reinforcement Learning Algorithms | Nov 16, 2021 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 3 | 5 |
| Automatic Intrinsic Reward Shaping for Exploration in Deep Reinforcement Learning | Jan 26, 2023 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 3 | 5 |
| One Policy to Run Them All: an End-to-end Learning Approach to Multi-Embodiment Locomotion | Sep 10, 2024 | AllDeep Reinforcement Learning | CodeCode Available | 3 | 5 |
| Dopamine: A Research Framework for Deep Reinforcement Learning | Dec 14, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 3 | 5 |
| Learning Bipedal Walking On Planned Footsteps For Humanoid Robots | Jul 26, 2022 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 3 | 5 |
| Deep symbolic regression for physics guided by units constraints: toward the automated discovery of physical laws | Mar 6, 2023 | Deep Reinforcement Learningregression | CodeCode Available | 3 | 5 |
| Deep Reinforcement Learning | Oct 15, 2018 | Deep Reinforcement LearningManagement | CodeCode Available | 3 | 5 |
| Streaming Deep Reinforcement Learning Finally Works | Oct 18, 2024 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 3 | 5 |
| Tianshou: a Highly Modularized Deep Reinforcement Learning Library | Jul 29, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 3 | 5 |
| FinRL: A Deep Reinforcement Learning Library for Automated Stock Trading in Quantitative Finance | Nov 19, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 3 | 5 |
| Distributed Prioritized Experience Replay | Mar 2, 2018 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 3 | 5 |
| Practical Deep Reinforcement Learning Approach for Stock Trading | Nov 19, 2018 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 3 | 5 |
| ADOPT: Modified Adam Can Converge with Any β_2 with the Optimal Rate | Nov 5, 2024 | Deep Reinforcement Learningimage-classification | CodeCode Available | 3 | 5 |
| Class Symbolic Regression: Gotta Fit 'Em All | Dec 4, 2023 | AllDeep Reinforcement Learning | CodeCode Available | 3 | 5 |
| Rainbow: Combining Improvements in Deep Reinforcement Learning | Oct 6, 2017 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 3 | 5 |
| XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library | Dec 25, 2023 | CPUDeep Reinforcement Learning | CodeCode Available | 3 | 5 |
| Learning to Walk in Minutes Using Massively Parallel Deep Reinforcement Learning | Sep 24, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 | 5 |
| Learning Efficient Online 3D Bin Packing on Packing Configuration Trees | Sep 29, 2021 | 3D Bin PackingDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| MAPF-GPT: Imitation Learning for Multi-Agent Pathfinding at Scale | Aug 29, 2024 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 2 | 5 |
| Learning Practically Feasible Policies for Online 3D Bin Packing | Aug 31, 2021 | 3D Bin PackingCollision Avoidance | CodeCode Available | 2 | 5 |
| Learning to Solve Job Shop Scheduling under Uncertainty | Mar 4, 2024 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Graph Neural Networks and Deep Reinforcement Learning Based Resource Allocation for V2X Communications | Jul 9, 2024 | Deep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Flow: A Modular Learning Framework for Mixed Autonomy Traffic | Oct 16, 2017 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Habitat 2.0: Training Home Assistants to Rearrange their Habitat | Jun 28, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 | 5 |
| FinRL-Meta: A Universe of Near-Real Market Environments for Data-Driven Deep Reinforcement Learning in Quantitative Finance | Dec 13, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 | 5 |
| Efficient World Models with Context-Aware Tokenization | Jun 27, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 2 | 5 |
| Flightmare: A Flexible Quadrotor Simulator | Sep 1, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 2 | 5 |
| Harfang3D Dog-Fight Sandbox: A Reinforcement Learning Research Platform for the Customized Control Tasks of Fighter Aircrafts | Oct 13, 2022 | Atari GamesDecision Making | CodeCode Available | 2 | 5 |
| MaskPlace: Fast Chip Placement via Reinforced Visual Representation Learning | Nov 24, 2022 | Deep Reinforcement LearningLayout Design | CodeCode Available | 2 | 5 |
| DIAMBRA Arena: a New Reinforcement Learning Platform for Research and Experimentation | Oct 19, 2022 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 2 | 5 |
| Developing A Multi-Agent and Self-Adaptive Framework with Deep Reinforcement Learning for Dynamic Portfolio Risk Management | Feb 1, 2024 | Deep Reinforcement LearningManagement | CodeCode Available | 2 | 5 |
| Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models | May 30, 2018 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 2 | 5 |
| Deep Reinforcement Learning for Multi-Agent Interaction | Aug 2, 2022 | BIG-bench Machine LearningCausal Inference | CodeCode Available | 2 | 5 |
| Revocable Deep Reinforcement Learning with Affinity Regularization for Outlier-Robust Graph Matching | Dec 16, 2020 | Combinatorial OptimizationDecision Making | CodeCode Available | 2 | 5 |
| Decoupling Representation Learning from Reinforcement Learning | Sep 14, 2020 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Deep Reinforcement Learning Based Joint Downlink Beamforming and RIS Configuration in RIS-aided MU-MISO Systems Under Hardware Impairments and Imperfect CSI | Oct 10, 2022 | Deep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Deep Reinforcement Learning with Enhanced PPO for Safe Mobile Robot Navigation | May 25, 2024 | Autonomous NavigationDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| DouZero: Mastering DouDizhu with Self-Play Deep Reinforcement Learning | Jun 11, 2021 | Card GamesDeep Reinforcement Learning | CodeCode Available | 2 | 5 |
| ElegantRL-Podracer: Scalable and Elastic Library for Cloud-Native Deep Reinforcement Learning | Dec 11, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 2 | 5 |
| Accelerated Policy Learning with Parallel Differentiable Simulation | Apr 14, 2022 | Deep Reinforcement Learning | CodeCode Available | 2 | 5 |
| Combinatorial Client-Master Multiagent Deep Reinforcement Learning for Task Offloading in Mobile Edge Computing | Feb 18, 2024 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 2 | 5 |