| Hindsight Reward Tweaking via Conditional Deep Reinforcement Learning | Sep 6, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Hierarchical Object-to-Zone Graph for Object Navigation | Sep 5, 2021 | Deep Reinforcement LearningObject | CodeCode Available | 1 |
| Soft Hierarchical Graph Recurrent Networks for Many-Agent Partially Observable Environments | Sep 5, 2021 | Deep Reinforcement LearningGraph Attention | —Unverified | 0 |
| An Exploration of Deep Learning Methods in Hungry Geese | Sep 5, 2021 | Deep LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| Temporal Shift Reinforcement Learning | Sep 5, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning for Battery Energy Storage Dispatch augmented with Model-based Optimizer | Sep 2, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation | Sep 1, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 0 |
| Learning Practically Feasible Policies for Online 3D Bin Packing | Aug 31, 2021 | 3D Bin PackingCollision Avoidance | CodeCode Available | 2 |
| WarpDrive: Extremely Fast End-to-End Deep Multi-Agent Reinforcement Learning on a GPU | Aug 31, 2021 | CPUDecision Making | CodeCode Available | 1 |
| Learning to Synthesize Programs as Interpretable and Generalizable Policies | Aug 31, 2021 | Deep Reinforcement LearningProgram Synthesis | CodeCode Available | 1 |
| Deep Reinforcement Learning at the Edge of the Statistical Precipice | Aug 30, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Communication-Computation Efficient Device-Edge Co-Inference via AutoML | Aug 30, 2021 | AutoMLDecoder | —Unverified | 0 |
| Investigating Vulnerabilities of Deep Neural Policies | Aug 30, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Path Planning for Cellular-Connected UAV: A DRL Solution with Quantum-Inspired Experience Replay | Aug 30, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Autonomous Curiosity for Real-Time Training Onboard Robotic Agents | Aug 29, 2021 | Deep Reinforcement Learningobject-detection | —Unverified | 0 |
| A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning | Aug 29, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | —Unverified | 0 |
| DASHA: Decentralized Autofocusing System with Hierarchical Agents | Aug 29, 2021 | Deep Reinforcement Learningobject-detection | CodeCode Available | 0 |
| Accelerating Serverless Computing by Harvesting Idle Resources | Aug 28, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning based Condition-oriented Maintenance Scheduling for Flow Line Systems | Aug 27, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Deep Reinforcement Learning for Wireless Resource Allocation Using Buffer State Information | Aug 27, 2021 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| WAD: A Deep Reinforcement Learning Agent for Urban Autonomous Driving | Aug 27, 2021 | Atari GamesAutonomous Driving | —Unverified | 0 |
| Deep Reinforcement Learning for Dynamic Band Switch in Cellular-Connected UAV | Aug 26, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Deep Reinforcement Learning in Computer Vision: A Comprehensive Survey | Aug 25, 2021 | Deep Reinforcement LearningImage Segmentation | —Unverified | 0 |
| Responsive Regulation of Dynamic UAV Communication Networks Based on Deep Reinforcement Learning | Aug 25, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Entropy-Aware Model Initialization for Effective Exploration in Deep Reinforcement Learning | Aug 24, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |