| DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning | Apr 6, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations | Apr 6, 2022 | Contrastive LearningDecision Making | CodeCode Available | 1 |
| Learning Pneumatic Non-Prehensile Manipulation with a Mobile Blower | Apr 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Quantum Multi-Agent Reinforcement Learning via Variational Quantum Circuit Design | Mar 20, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Multi-Objective reward generalization: Improving performance of Deep Reinforcement Learning for applications in single-asset trading | Mar 9, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Deep Reinforcement Learning for Entity Alignment | Mar 7, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Model-free Neural Lyapunov Control for Safe Robot Navigation | Mar 2, 2022 | Deep Reinforcement LearningRobot Navigation | CodeCode Available | 1 |
| Affordance Learning from Play for Sample-Efficient Policy Learning | Mar 1, 2022 | Deep Reinforcement LearningMotion Planning | CodeCode Available | 1 |
| Building a 3-Player Mahjong AI using Deep Reinforcement Learning | Feb 25, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Blockchain Framework for Artificial Intelligence Computation | Feb 23, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics | Feb 23, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric Vehicles | Feb 22, 2022 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 |
| CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving | Feb 17, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight Control | Feb 16, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Exploring Deep Reinforcement Learning-Assisted Federated Learning for Online Resource Allocation in Privacy-Persevering EdgeIoT | Feb 15, 2022 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 1 |
| Optimizing Sequential Experimental Design with Deep Reinforcement Learning | Feb 2, 2022 | Deep Reinforcement LearningExperimental Design | CodeCode Available | 1 |
| Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies | Feb 1, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| CoTV: Cooperative Control for Traffic Light Signals and Connected Autonomous Vehicles using Deep Reinforcement Learning | Jan 31, 2022 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Mixed Traffic Environments | Jan 30, 2022 | Autonomous VehiclesDecision Making | CodeCode Available | 1 |
| Mask-based Latent Reconstruction for Reinforcement Learning | Jan 28, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| The First AI4TSP Competition: Learning to Solve Stochastic Routing Problems | Jan 25, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Solving Dynamic Graph Problems with Multi-Attention Deep Reinforcement Learning | Jan 13, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Verified Probabilistic Policies for Deep Reinforcement Learning | Jan 10, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Mirror Learning: A Unifying Framework of Policy Optimisation | Jan 7, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Balsa: Learning a Query Optimizer Without Expert Demonstrations | Jan 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation | Jan 5, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Hybrid intelligence for dynamic job-shop scheduling with deep reinforcement learning and attention mechanism | Jan 3, 2022 | Deep Reinforcement LearningGraph Representation Learning | CodeCode Available | 1 |
| SimSR: Simple Distance-based State Representation for Deep Reinforcement Learning | Dec 31, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Lane Change Decision-Making through Deep Reinforcement Learning | Dec 24, 2021 | Autonomous DrivingDecision Making | CodeCode Available | 1 |
| Safety and Liveness Guarantees through Reach-Avoid Reinforcement Learning | Dec 23, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach for Solving the Traveling Salesman Problem with Drone | Dec 22, 2021 | Combinatorial OptimizationComputational Efficiency | CodeCode Available | 1 |
| Adversarial Deep Reinforcement Learning for Improving the Robustness of Multi-agent Autonomous Driving Policies | Dec 22, 2021 | Autonomous DrivingDeep Reinforcement Learning | CodeCode Available | 1 |
| Space Non-cooperative Object Active Tracking with Deep Reinforcement Learning | Dec 18, 2021 | Deep Reinforcement LearningPose Estimation | CodeCode Available | 1 |
| ColO-RAN: Developing Machine Learning-based xApps for Open RAN Closed-loop Control on Programmable Experimental Platforms | Dec 17, 2021 | Deep Reinforcement LearningScheduling | CodeCode Available | 1 |
| Stochastic Actor-Executor-Critic for Image-to-Image Translation | Dec 14, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Faster Deep Reinforcement Learning with Slower Online Network | Dec 10, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Federated Deep Reinforcement Learning for the Distributed Control of NextG Wireless Networks | Dec 7, 2021 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| Functional Regularization for Reinforcement Learning via Learned Fourier Features | Dec 6, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| EDGE: Explaining Deep Reinforcement Learning Policies | Dec 1, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Symbolic Regression via Deep Reinforcement Learning Enhanced Genetic Programming Seeding | Dec 1, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Automatic Data Augmentation for Generalization in Reinforcement Learning | Dec 1, 2021 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| NovelD: A Simple yet Effective Exploration Criterion | Dec 1, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 1 |
| User Allocation in Mobile Edge Computing: A Deep Reinforcement Learning Approach | Nov 11, 2021 | CPUDeep Reinforcement Learning | CodeCode Available | 1 |
| Data-Efficient Deep Reinforcement Learning for Attitude Control of Fixed-Wing UAVs: Field Experiments | Nov 7, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Robust Deep Reinforcement Learning for Quadcopter Control | Nov 6, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Learning Large Neighborhood Search Policy for Integer Programming | Nov 1, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| URLB: Unsupervised Reinforcement Learning Benchmark | Oct 28, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Learning Domain Invariant Representations in Goal-conditioned Block MDPs | Oct 27, 2021 | Deep Reinforcement LearningDomain Generalization | CodeCode Available | 1 |
| Towards Robust Bisimulation Metric Learning | Oct 27, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Learning Collaborative Policies to Solve NP-hard Routing Problems | Oct 26, 2021 | Deep Reinforcement LearningTraveling Salesman Problem | CodeCode Available | 1 |