| NROWAN-DQN: A Stable Noisy Network with Noise Reduction and Online Weight Adjustment for Exploration | Jun 19, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learn to Earn: Enabling Coordination within a Ride Hailing Fleet | Jun 19, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Reinforcement Learning Approach for Transient Control of Liquid Rocket Engines | Jun 19, 2020 | Deep Reinforcement LearningModel Predictive Control | —Unverified | 0 |
| WD3: Taming the Estimation Bias in Deep Reinforcement Learning | Jun 18, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| DREAM: Deep Regret minimization with Advantage baselines and Model-free learning | Jun 18, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Deep Reinforcement Learning amidst Lifelong Non-Stationarity | Jun 18, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to Track Dynamic Targets in Partially Known Environments | Jun 17, 2020 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| Neural Ordinary Differential Equation Control of Dynamics on Graphs | Jun 17, 2020 | Deep Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| Deep Reinforcement Learning Controller for 3D Path-following and Collision Avoidance by Autonomous Underwater Vehicles | Jun 17, 2020 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Forgetful Experience Replay in Hierarchical Reinforcement Learning from Demonstrations | Jun 17, 2020 | Deep Reinforcement LearningHierarchical Reinforcement Learning | CodeCode Available | 1 |
| Learning What to Defer for Maximum Independent Sets | Jun 17, 2020 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Agent Modelling under Partial Observability for Deep Reinforcement Learning | Jun 16, 2020 | DecoderDeep Reinforcement Learning | CodeCode Available | 1 |
| Solving the Order Batching and Sequencing Problem using Deep Reinforcement Learning | Jun 16, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| COLREG-Compliant Collision Avoidance for Unmanned Surface Vehicle using Deep Reinforcement Learning | Jun 16, 2020 | Autonomous VehiclesCollision Avoidance | —Unverified | 0 |
| Index Selection for NoSQL Database with Deep Reinforcement Learning | Jun 16, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Designing high-fidelity multi-qubit gates for semiconductor quantum dots through deep reinforcement learning | Jun 15, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Pipeline PSRO: A Scalable Approach for Finding Approximate Nash Equilibria in Large Games | Jun 15, 2020 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms in Cooperative Tasks | Jun 14, 2020 | BenchmarkingDeep Reinforcement Learning | CodeCode Available | 1 |
| StarCraft II Build Order Optimization using Deep Reinforcement Learning and Monte-Carlo Tree Search | Jun 12, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Systematic Generalisation through Task Temporal Logic and Deep Reinforcement Learning | Jun 12, 2020 | Deep Reinforcement LearningNegation | —Unverified | 0 |
| Continuous Control for Searching and Planning with a Learned Model | Jun 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Decorrelated Double Q-learning | Jun 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep Reinforcement Learning for Neural Control | Jun 12, 2020 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Mutual Information Based Knowledge Transfer Under State-Action Dimension Mismatch | Jun 12, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| A Brief Look at Generalization in Visual Meta-Reinforcement Learning | Jun 12, 2020 | Deep Reinforcement LearningMeta Reinforcement Learning | —Unverified | 0 |