| A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning | Jun 3, 2021 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| Learning Discrete World Models for Heuristic Search | Sep 14, 2024 | Deep Reinforcement LearningHeuristic Search | CodeCode Available | 1 |
| Beacon, a lightweight deep reinforcement learning benchmark library for flow control | Feb 27, 2024 | BenchmarkingCPU | CodeCode Available | 1 |
| Learning Financial Asset-Specific Trading Rules via Deep Reinforcement Learning | Oct 27, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Amortizing intractable inference in diffusion models for vision, language, and control | May 31, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Learning Multi-Pursuit Evasion for Safe Targeted Navigation of Drones | Apr 7, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Learning Improvement Heuristics for Solving Routing Problems | Dec 12, 2019 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 1 |
| Learning Large Neighborhood Search Policy for Integer Programming | Nov 1, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| A Constraint Enforcement Deep Reinforcement Learning Framework for Optimal Energy Storage Systems Dispatch | Jul 26, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Learning Off-Policy with Online Planning | Aug 23, 2020 | ARCContinuous Control | CodeCode Available | 1 |
| A multi-agent reinforcement learning model of common-pool resource appropriation | Jul 20, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Balsa: Learning a Query Optimizer Without Expert Demonstrations | Jan 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Learning Symbolic Rules over Abstract Meaning Representations for Textual Reinforcement Learning | Jul 5, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Learning Synergies between Pushing and Grasping with Self-supervised Deep Reinforcement Learning | Mar 27, 2018 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 |
| Learning to Identify Critical States for Reinforcement Learning from Videos | Aug 15, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems | Feb 9, 2020 | Combinatorial OptimizationDecoder | CodeCode Available | 1 |
| Learning to Play No-Press Diplomacy with Best Response Policy Iteration | Jun 8, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| BeBold: Exploration Beyond the Boundary of Explored Regions | Dec 15, 2020 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 |
| Deep Reinforcement Learning Guided Improvement Heuristic for Job Shop Scheduling | Nov 20, 2022 | Deep Reinforcement LearningGraph Neural Network | CodeCode Available | 1 |
| Bridging RL Theory and Practice with the Effective Horizon | Apr 19, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Learning to Track Dynamic Targets in Partially Known Environments | Jun 17, 2020 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| Learning Trajectories for Visual-Inertial System Calibration via Model-based Heuristic Deep Reinforcement Learning | Nov 4, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Learning View and Target Invariant Visual Servoing for Navigation | Mar 4, 2020 | Deep Reinforcement LearningReinforcement Learning | CodeCode Available | 1 |
| Lenient Multi-Agent Deep Reinforcement Learning | Jul 14, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Leveraging Procedural Generation for Learning Autonomous Peg-in-Hole Assembly in Space | May 2, 2024 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone | Dec 22, 2021 | Combinatorial OptimizationComputational Efficiency | CodeCode Available | 1 |
| Logic and the 2-Simplicial Transformer | May 1, 2020 | Deep Reinforcement LearningInductive Bias | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games | Jul 18, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| Making Sense of Vision and Touch: Self-Supervised Learning of Multimodal Representations for Contact-Rich Tasks | Oct 24, 2018 | Contact-rich ManipulationDeep Reinforcement Learning | CodeCode Available | 1 |
| AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning | Mar 2, 2020 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| Marathon Environments: Multi-Agent Continuous Control Benchmarks in a Modern Video Game Engine | Feb 25, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Mask-based Latent Reconstruction for Reinforcement Learning | Jan 28, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| AutoPhase: Compiler Phase-Ordering for High Level Synthesis with Deep Reinforcement Learning | Jan 15, 2019 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 |
| An Application of Deep Reinforcement Learning to Algorithmic Trading | Apr 7, 2020 | Algorithmic TradingDeep Reinforcement Learning | CodeCode Available | 1 |
| Maximum a Posteriori Policy Optimisation | Jun 14, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 |
| AutoShard: Automated Embedding Table Sharding for Recommender Systems | Aug 12, 2022 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 |
| Memory Gym: Towards Endless Tasks to Benchmark Memory Capabilities of Agents | Sep 29, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Meta-AAD: Active Anomaly Detection with Deep Reinforcement Learning | Sep 16, 2020 | Anomaly DetectionDeep Reinforcement Learning | CodeCode Available | 1 |
| Meta-Learning-Based Deep Reinforcement Learning for Multiobjective Optimization Problems | May 6, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Action Branching Architectures for Deep Reinforcement Learning | Nov 24, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 |
| An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search | Dec 10, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| A Platform-Agnostic Deep Reinforcement Learning Framework for Effective Sim2Real Transfer towards Autonomous Driving | Apr 14, 2023 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 1 |
| Mitigating Adversarial Perturbations for Deep Reinforcement Learning via Vector Quantization | Oct 4, 2024 | Deep Reinforcement LearningQuantization | CodeCode Available | 1 |
| AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation | Jul 27, 2022 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| Model-Based Transfer Learning for Contextual Reinforcement Learning | Aug 8, 2024 | Bayesian Optimizationcontinuous-control | CodeCode Available | 1 |
| Latent Imagination Facilitates Zero-Shot Transfer in Autonomous Racing | Mar 8, 2021 | Autonomous Racingcontinuous-control | CodeCode Available | 1 |
| Model-free Deep Reinforcement Learning for Urban Autonomous Driving | Apr 20, 2019 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| Model Predictive Actor-Critic: Accelerating Robot Skill Acquisition with Deep Reinforcement Learning | Mar 25, 2021 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement Learning | Dec 27, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 1 |