| Principled Exploration via Optimistic Bootstrapping and Backward Induction | May 13, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 |
| Adversarial Reinforcement Learning in Dynamic Channel Access and Power Control | May 12, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective | May 11, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Zero-Shot Reinforcement Learning on Graphs for Autonomous Exploration Under Uncertainty | May 11, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Value Iteration in Continuous Actions, States and Time | May 10, 2021 | Deep Reinforcement Learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach to Audio-Based Navigation in a Multi-Speaker Environment | May 10, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning | May 10, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model | May 8, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A parallel-network continuous quantitative trading model with GARCH and PPO | May 8, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Utilizing Skipped Frames in Action Repeats via Pseudo-Actions | May 7, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep reinforcement learning-designed radiofrequency waveform in MRI | May 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Context-Based Soft Actor Critic for Environments with Non-stationary Dynamics | May 7, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Meta-Learning-Based Deep Reinforcement Learning for Multiobjective Optimization Problems | May 6, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Time-Aware Q-Networks: Resolving Temporal Irregularity for Deep Reinforcement Learning | May 6, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Solve routing problems with a residual edge-graph attention neural network | May 6, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Algorithms for Regenerative Stopping Problems with Applications to Shipping Consolidation in Logistics | May 5, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Safety Enhancement for Deep Reinforcement Learning in Autonomous Separation Assurance | May 5, 2021 | Data AugmentationDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Adaptive Exploration of Unknown Environments | May 4, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning | May 4, 2021 | Behavioural cloningDeep Reinforcement Learning | —Unverified | 0 |
| Curious Exploration and Return-based Memory Restoration for Deep Reinforcement Learning | May 2, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Discovering Diverse Athletic Jumping Strategies | May 2, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| BACKDOORL: Backdoor Attack against Competitive Reinforcement Learning | May 2, 2021 | Atari GamesBackdoor Attack | —Unverified | 0 |
| Pedestrian Collision Avoidance for Autonomous Vehicles at Unsignalized Intersection Using Deep Q-Network | May 1, 2021 | Autonomous VehiclesCollision Avoidance | —Unverified | 0 |
| Discrete-Time Mean Field Control with Environment States | Apr 30, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Emotional Contagion-Aware Deep Reinforcement Learning for Antagonistic Crowd Simulation | Apr 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |