| On the Emergence of Whole-body Strategies from Humanoid Robot Push-recovery Learning | Apr 29, 2021 | Deep Reinforcement LearningHumanoid Control | —Unverified | 0 |
| Hypernetwork Dismantling via Deep Reinforcement Learning | Apr 29, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Trading with Predictable Returns | Apr 29, 2021 | ClusteringDeep Reinforcement Learning | CodeCode Available | 1 |
| Adapting to Reward Progressivity via Spectral Reinforcement Learning | Apr 29, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| End-to-End Intersection Handling using Multi-Agent Deep Reinforcement Learning | Apr 28, 2021 | Deep Reinforcement LearningNavigate | —Unverified | 0 |
| SocialAI 0.1: Towards a Benchmark to Stimulate Research on Socio-Cognitive Abilities in Deep Reinforcement Learning Agents | Apr 27, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| A Scalable and Reproducible System-on-Chip Simulation for Reinforcement Learning | Apr 27, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Controlling earthquake-like instabilities using artificial intelligence | Apr 27, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| End-to-end grasping policies for human-in-the-loop robots via deep reinforcement learning | Apr 26, 2021 | Deep Reinforcement LearningElectromyography (EMG) | CodeCode Available | 0 |
| Computational Performance of Deep Reinforcement Learning to find Nash Equilibria | Apr 26, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Efficient Hyperparameter Optimization for Physics-based Character Animation | Apr 26, 2021 | Bayesian OptimizationDeep Reinforcement Learning | —Unverified | 0 |
| Development of a Soft Actor Critic Deep Reinforcement Learning Approach for Harnessing Energy Flexibility in a Large Office Building | Apr 25, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| A Deep Reinforcement Learning Approach for the Meal Delivery Problem | Apr 24, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Graph Neural Network Reinforcement Learning for Autonomous Mobility-on-Demand Systems | Apr 23, 2021 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Formula RL: Deep Reinforcement Learning for Autonomous Racing using Telemetry Data | Apr 22, 2021 | Autonomous RacingDeep Reinforcement Learning | —Unverified | 0 |
| XAI-N: Sensor-based Robot Navigation using Expert Policies and Decision Trees | Apr 22, 2021 | Deep Reinforcement LearningExplainable Artificial Intelligence (XAI) | CodeCode Available | 0 |
| Model-aided Deep Reinforcement Learning for Sample-efficient UAV Trajectory Design in IoT Networks | Apr 21, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| Tackling Variabilities in Autonomous Driving | Apr 21, 2021 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| DRL: Deep Reinforcement Learning for Intelligent Robot Control -- Concept, Literature, and Future | Apr 20, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Scalable Synthesis of Verified Controllers in Deep Reinforcement Learning | Apr 20, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Network-wide traffic signal control optimization using a multi-agent deep reinforcement learning | Apr 20, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| GDDR: GNN-based Data-Driven Routing | Apr 20, 2021 | Deep Reinforcement LearningGraph Neural Network | —Unverified | 0 |
| Adaptive learning for financial markets mixing model-based and model-free RL for volatility targeting | Apr 19, 2021 | Deep Reinforcement Learningmodel | —Unverified | 0 |
| Deep Reinforcement Learning in a Monetary Model | Apr 19, 2021 | Deep Reinforcement Learningmodel | —Unverified | 0 |
| Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning | Apr 19, 2021 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 |
| Quick Learner Automated Vehicle Adapting its Roadmanship to Varying Traffic Cultures with Meta Reinforcement Learning | Apr 18, 2021 | Deep Reinforcement LearningMeta Reinforcement Learning | —Unverified | 0 |
| Learning on a Budget via Teacher Imitation | Apr 17, 2021 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Action Advising with Advice Imitation in Deep Reinforcement Learning | Apr 17, 2021 | Atari GamesBehavioural cloning | CodeCode Available | 0 |
| MT-Opt: Continuous Multi-Task Robotic Reinforcement Learning at Scale | Apr 16, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Joint Attention for Multi-Agent Coordination and Social Learning | Apr 15, 2021 | Deep Reinforcement LearningInductive Bias | —Unverified | 0 |
| Quantum Architecture Search via Deep Reinforcement Learning | Apr 15, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| GridToPix: Training Embodied Agents with Minimal Supervision | Apr 14, 2021 | Deep Reinforcement LearningPointGoal Navigation | —Unverified | 0 |
| Decomposed Soft Actor-Critic Method for Cooperative Multi-Agent Reinforcement Learning | Apr 14, 2021 | counterfactualDeep Reinforcement Learning | CodeCode Available | 1 |
| GAN-Based Interactive Reinforcement Learning from Demonstration and Human Evaluative Feedback | Apr 14, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Visual Comfort Aware-Reinforcement Learning for Depth Adjustment of Stereoscopic 3D Images | Apr 14, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Optimizing the Long-Term Average Reward for Continuing MDPs: A Technical Report | Apr 13, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Thief, Beware of What Get You There: Towards Understanding Model Extraction Attack | Apr 13, 2021 | Deep Reinforcement LearningModel extraction | —Unverified | 0 |
| Dynamic Matching Markets in Power Grid: Concepts and Solution using Deep Reinforcement Learning | Apr 12, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Deep Reinforcement Learning Based Controller for Active Heave Compensation | Apr 12, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learn Goal-Conditioned Policy with Intrinsic Motivation for Deep Reinforcement Learning | Apr 11, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| The Atari Data Scraper | Apr 11, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Symmetry reduction for deep reinforcement learning active control of chaotic spatiotemporal dynamics | Apr 9, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Behavior-Guided Actor-Critic: Improving Exploration via Learning Policy Behavior Representation for Deep Reinforcement Learning | Apr 9, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 |
| A Reinforcement-Learning-Based Energy-Efficient Framework for Multi-Task Video Analytics Pipeline | Apr 9, 2021 | Deep Reinforcement LearningInstance Segmentation | —Unverified | 0 |
| Jamming-Resilient Path Planning for Multiple UAVs via Deep Reinforcement Learning | Apr 9, 2021 | Collision AvoidanceDecision Making | —Unverified | 0 |
| Arena-Rosnav: Towards Deployment of Deep-Reinforcement-Learning-Based Obstacle Avoidance into Conventional Autonomous Navigation Systems | Apr 8, 2021 | Autonomous NavigationDeep Reinforcement Learning | CodeCode Available | 1 |
| Connecting Deep-Reinforcement-Learning-based Obstacle Avoidance with Conventional Global Planners using Waypoint Generators | Apr 8, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| A Reinforcement Learning Environment For Job-Shop Scheduling | Apr 8, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Improving Robustness of Deep Reinforcement Learning Agents: Environment Attack based on the Critic Network | Apr 7, 2021 | Adversarial AttackDeep Reinforcement Learning | CodeCode Available | 0 |
| Risk-Conditioned Distributional Soft Actor-Critic for Risk-Sensitive Navigation | Apr 7, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |