| DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning | Apr 6, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations | Apr 6, 2022 | Contrastive LearningDecision Making | CodeCode Available | 1 |
| Learning Pneumatic Non-Prehensile Manipulation with a Mobile Blower | Apr 5, 2022 | Deep Reinforcement Learning | CodeCode Available | 1 |
| Quantum Multi-Agent Reinforcement Learning via Variational Quantum Circuit Design | Mar 20, 2022 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Multi-Objective reward generalization: Improving performance of Deep Reinforcement Learning for applications in single-asset trading | Mar 9, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Deep Reinforcement Learning for Entity Alignment | Mar 7, 2022 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Model-free Neural Lyapunov Control for Safe Robot Navigation | Mar 2, 2022 | Deep Reinforcement LearningRobot Navigation | CodeCode Available | 1 |
| Affordance Learning from Play for Sample-Efficient Policy Learning | Mar 1, 2022 | Deep Reinforcement LearningMotion Planning | CodeCode Available | 1 |
| Building a 3-Player Mahjong AI using Deep Reinforcement Learning | Feb 25, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Blockchain Framework for Artificial Intelligence Computation | Feb 23, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Using Deep Reinforcement Learning with Automatic Curriculum Learning for Mapless Navigation in Intralogistics | Feb 23, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Comparative Study of Deep Reinforcement Learning-based Transferable Energy Management Strategies for Hybrid Electric Vehicles | Feb 22, 2022 | Deep Reinforcement Learningenergy management | CodeCode Available | 1 |
| CADRE: A Cascade Deep Reinforcement Learning Framework for Vision-based Autonomous Urban Driving | Feb 17, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Soft Actor-Critic Deep Reinforcement Learning for Fault Tolerant Flight Control | Feb 16, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Exploring Deep Reinforcement Learning-Assisted Federated Learning for Online Resource Allocation in Privacy-Persevering EdgeIoT | Feb 15, 2022 | Deep Reinforcement LearningEdge-computing | CodeCode Available | 1 |
| Optimizing Sequential Experimental Design with Deep Reinforcement Learning | Feb 2, 2022 | Deep Reinforcement LearningExperimental Design | CodeCode Available | 1 |
| Accelerating Deep Reinforcement Learning for Digital Twin Network Optimization with Evolutionary Strategies | Feb 1, 2022 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| CoTV: Cooperative Control for Traffic Light Signals and Connected Autonomous Vehicles using Deep Reinforcement Learning | Jan 31, 2022 | Autonomous VehiclesDeep Reinforcement Learning | CodeCode Available | 1 |
| Graph Convolution-Based Deep Reinforcement Learning for Multi-Agent Decision-Making in Mixed Traffic Environments | Jan 30, 2022 | Autonomous VehiclesDecision Making | CodeCode Available | 1 |
| Mask-based Latent Reconstruction for Reinforcement Learning | Jan 28, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| The First AI4TSP Competition: Learning to Solve Stochastic Routing Problems | Jan 25, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Solving Dynamic Graph Problems with Multi-Attention Deep Reinforcement Learning | Jan 13, 2022 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Verified Probabilistic Policies for Deep Reinforcement Learning | Jan 10, 2022 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Mirror Learning: A Unifying Framework of Policy Optimisation | Jan 7, 2022 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Sample Efficient Deep Reinforcement Learning via Uncertainty Estimation | Jan 5, 2022 | continuous-controlContinuous Control | CodeCode Available | 1 |