| Deep Reinforcement Learning for Cost-Effective Medical Diagnosis | Feb 20, 2023 | Anomaly DetectionDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for Entity Alignment | Mar 7, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Actor Prioritized Experience Replay | Sep 1, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Automated Cloud Provisioning on AWS using Deep Reinforcement Learning | Sep 13, 2017 | Cloud ComputingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep reinforcement learning for large-scale epidemic control | Mar 30, 2020 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for List-wise Recommendations | Dec 30, 2017 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 | 5 |
| A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric Vehicles | Feb 22, 2022 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 | 5 |
| Automating DBSCAN via Deep Reinforcement Learning | Aug 9, 2022 | ClusteringComputational Efficiency | CodeCode Available | 1 | 5 |
| A Comparative Study of Deep Reinforcement Learning Models: DQN vs PPO vs A2C | Jul 19, 2024 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Cross-Modal Contrastive Learning of Representations for Navigation using Lightweight, Low-Cost Millimeter Wave Radar for Adverse Environmental Conditions | Jan 10, 2021 | Autonomous NavigationContrastive Learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for Real-Time Optimization of Pumps in Water Distribution Systems | Oct 13, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning for Resource Allocation in Business Processes | Mar 29, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Autonomous Driving using Residual Sensor Fusion and Deep Reinforcement Learning | Dec 27, 2023 | Autonomous DrivingDecision Making | CodeCode Available | 1 | 5 |
| Autonomous Exploration Under Uncertainty via Deep Reinforcement Learning on Graphs | Jul 24, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Avalon: A Benchmark for RL Generalization Using Procedurally Generated Worlds | Oct 24, 2022 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 | 5 |
| AutoPhase: Juggling HLS Phase Orderings in Random Forests with Deep Reinforcement Learning | Mar 2, 2020 | Deep Reinforcement LearningHigh-Level Synthesis | CodeCode Available | 1 | 5 |
| A Competition Winning Deep Reinforcement Learning Agent in microRTS | Feb 12, 2024 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 1 | 5 |
| AutoShard: Automated Embedding Table Sharding for Recommender Systems | Aug 12, 2022 | Deep Reinforcement LearningRecommendation Systems | CodeCode Available | 1 | 5 |
| Crowd-Robot Interaction: Crowd-aware Robot Navigation with Attention-based Deep Reinforcement Learning | Sep 24, 2018 | Deep Reinforcement LearningHuman Dynamics | CodeCode Available | 1 | 5 |
| Data-Efficient Reinforcement Learning with Self-Predictive Representations | Jul 12, 2020 | Atari Games 100kData Augmentation | CodeCode Available | 1 | 5 |
| Deep Active Inference for Partially Observable MDPs | Sep 8, 2020 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 | 5 |
| Deep Reinforcement Learning Control of Quantum Cartpoles | Oct 21, 2019 | Deep LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A multi-agent reinforcement learning model of common-pool resource appropriation | Jul 20, 2017 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 | 5 |
| Balsa: Learning a Query Optimizer Without Expert Demonstrations | Jan 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Consciousness-Inspired Planning Agent for Model-Based Reinforcement Learning | Jun 3, 2021 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 | 5 |
| Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the Past | Jun 10, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 | 5 |
| A2C is a special case of PPO | May 18, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Benchmarking Actor-Critic Deep Reinforcement Learning Algorithms for Robotics Control with Action Constraints | Apr 18, 2023 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Constraint Enforcement Deep Reinforcement Learning Framework for Optimal Energy Storage Systems Dispatch | Jul 26, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| BeBold: Exploration Beyond the Boundary of Explored Regions | Dec 15, 2020 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 1 | 5 |
| Amortizing intractable inference in diffusion models for vision, language, and control | May 31, 2024 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Learning Multi-Pursuit Evasion for Safe Targeted Navigation of Drones | Apr 7, 2023 | Deep Reinforcement Learning | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Algorithm Using Dynamic Attention Model for Vehicle Routing Problems | Feb 9, 2020 | Combinatorial OptimizationDecoder | CodeCode Available | 1 | 5 |
| Benchmarking Reinforcement Learning Techniques for Autonomous Navigation | Oct 10, 2022 | Autonomous NavigationBenchmarking | CodeCode Available | 1 | 5 |
| Beyond The Rainbow: High Performance Deep Reinforcement Learning on a Desktop PC | Nov 6, 2024 | Computational EfficiencyDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Defeating Proactive Jammers Using Deep Reinforcement Learning for Resource-Constrained IoT Networks | Jul 13, 2023 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| An actor-critic algorithm with policy gradients to solve the job shop scheduling problem using deep double recurrent agents | Oct 18, 2021 | Deep Reinforcement LearningJob Shop Scheduling | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone | Dec 22, 2021 | Combinatorial OptimizationComputational Efficiency | CodeCode Available | 1 | 5 |
| A Deep Reinforcement Learning Approach for Finding Non-Exploitable Strategies in Two-Player Atari Games | Jul 18, 2022 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Bridging Imagination and Reality for Model-Based Deep Reinforcement Learning | Oct 23, 2020 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 | 5 |
| CORE: Towards Scalable and Efficient Causal Discovery with Reinforcement Learning | Jan 30, 2024 | Causal DiscoveryDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| DHRL-FNMR: An Intelligent Multicast Routing Approach Based on Deep Hierarchical Reinforcement Learning in SDN | May 30, 2023 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Bridging RL Theory and Practice with the Effective Horizon | Apr 19, 2023 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| Bayesian Soft Actor-Critic: A Directed Acyclic Strategy Graph Based Deep Reinforcement Learning | Aug 11, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Action Branching Architectures for Deep Reinforcement Learning | Nov 24, 2017 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |
| Bridging State and History Representations: Understanding Self-Predictive RL | Jan 17, 2024 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 | 5 |
| AADG: Automatic Augmentation for Domain Generalization on Retinal Image Segmentation | Jul 27, 2022 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving | Feb 17, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 | 5 |
| Contrastive UCB: Provably Efficient Contrastive Self-Supervised Learning in Online Reinforcement Learning | Jul 29, 2022 | Contrastive LearningDeep Reinforcement Learning | CodeCode Available | 1 | 5 |
| Continuous-Time Fitted Value Iteration for Robust Policies | Oct 5, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 | 5 |