| Quick Learner Automated Vehicle Adapting its Roadmanship to Varying Traffic Cultures with Meta Reinforcement Learning | Apr 18, 2021 | Deep Reinforcement LearningMeta Reinforcement Learning | —Unverified | 0 |
| Learning on a Budget via Teacher Imitation | Apr 17, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Action Advising with Advice Imitation in Deep Reinforcement Learning | Apr 17, 2021 | Atari GamesBehavioural cloning | CodeCode Available | 0 |
| MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale | Apr 16, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Joint Attention for Multi-Agent Coordination and Social Learning | Apr 15, 2021 | Deep Reinforcement LearningInductive Bias | —Unverified | 0 |
| Quantum Architecture Search via Deep Reinforcement Learning | Apr 15, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| GridToPix: Training Embodied Agents with Minimal Supervision | Apr 14, 2021 | Deep Reinforcement LearningPointGoal Navigation | —Unverified | 0 |
| Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning | Apr 14, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 |
| GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback | Apr 14, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Visual Comfort Aware-Reinforcement Learning for Depth Adjustment of Stereoscopic 3D Images | Apr 14, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Optimizing the Long-Term Average Reward for Continuing MDPs: A Technical Report | Apr 13, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Thief, Beware of What Get You There: Towards Understanding Model Extraction Attack | Apr 13, 2021 | Deep Reinforcement LearningModel extraction | —Unverified | 0 |
| Dynamic Matching Markets in Power Grid: Concepts and Solution using Deep Reinforcement Learning | Apr 12, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Deep Reinforcement Learning Based Controller for Active Heave Compensation | Apr 12, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep Reinforcement Learning | Apr 11, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| The Atari Data Scraper | Apr 11, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Symmetry reduction for deep reinforcement learning active control of chaotic spatiotemporal dynamics | Apr 9, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy Behavior Representation for Deep Reinforcement Learning | Apr 9, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 |
| A Reinforcement-Learning-Based Energy-Efficient Framework for Multi-Task Video Analytics Pipeline | Apr 9, 2021 | Deep Reinforcement LearningInstance Segmentation | —Unverified | 0 |
| Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement Learning | Apr 9, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Arena-Rosnav: Towards Deployment of Deep-Reinforcement-Learning-Based Obstacle Avoidance into Conventional Autonomous Navigation Systems | Apr 8, 2021 | Autonomous NavigationDeep Reinforcement Learning | CodeCode Available | 1 |
| Connecting Deep-Reinforcement-Learning-based Obstacle Avoidance with Conventional Global Planners using Waypoint Generators | Apr 8, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Reinforcement Learning Environment For Job-Shop Scheduling | Apr 8, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Improving Robustness of Deep Reinforcement Learning Agents: Environment Attack based on the Critic Network | Apr 7, 2021 | Adversarial AttackDeep Reinforcement Learning | CodeCode Available | 0 |
| Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive Navigation | Apr 7, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |