| Relation-Aware Transformer for Portfolio Policy Learning | Jul 1, 2020 | Deep Reinforcement LearningRelation | CodeCode Available | 1 |
| UAV Path Planning for Wireless Data Harvesting: A Deep Reinforcement Learning Approach | Jul 1, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| MDP Homomorphic Networks: Group Symmetries in Reinforcement Learning | Jun 30, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Online 3D Bin Packing with Constrained Deep Reinforcement Learning | Jun 26, 2020 | 3D Bin PackingCollision Avoidance | CodeCode Available | 1 |
| Widening the Pipeline in Human-Guided Reinforcement Learning with Explanation and Context-Aware Data Augmentation | Jun 26, 2020 | Atari GamesData Augmentation | CodeCode Available | 1 |
| A Closer Look at Invalid Action Masking in Policy Gradient Algorithms | Jun 25, 2020 | Deep Reinforcement LearningReal-Time Strategy Games | CodeCode Available | 1 |
| Automatic Data Augmentation for Generalization in Deep Reinforcement Learning | Jun 23, 2020 | Data AugmentationDeep Reinforcement Learning | CodeCode Available | 1 |
| Experience Replay with Likelihood-free Importance Weights | Jun 23, 2020 | Deep Reinforcement LearningOpenAI Gym | CodeCode Available | 1 |
| DREAM: Deep Regret minimization with Advantage baselines and Model-free learning | Jun 18, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Learning to Track Dynamic Targets in Partially Known Environments | Jun 17, 2020 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |