| Goal Misgeneralization in Deep Reinforcement Learning | May 28, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 1 |
| Transferable Deep Reinforcement Learning Framework for Autonomous Vehicles with Joint Radar-Data Communications | May 28, 2021 | Autonomous VehiclesDeep Reinforcement Learning | —Unverified | 0 |
| Branching Dueling Q-Network Based Online Scheduling of a Microgrid With Distributed Energy Storage Systems | May 27, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Context-aware taxi dispatching at city-scale using deep reinforcement learning | May 26, 2021 | Action GenerationDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Radio Resource Allocation and Management in Next Generation Heterogeneous Wireless Networks: A Survey | May 25, 2021 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Towards Scalable Verification of Deep Reinforcement Learning | May 25, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Robust Value Iteration for Continuous Control Tasks | May 25, 2021 | continuous-controlContinuous Control | CodeCode Available | 1 |
| KnowSR: Knowledge Sharing among Homogeneous Agents in Multi-agent Reinforcement Learning | May 25, 2021 | Deep Reinforcement LearningKnowledge Distillation | —Unverified | 0 |
| Interpretable UAV Collision Avoidance using Deep Reinforcement Learning | May 25, 2021 | Collision AvoidanceDeep Reinforcement Learning | —Unverified | 0 |
| IGO-QNN: Quantum Neural Network Architecture for Inductive Grover Oracularization | May 25, 2021 | Deep Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| Safe Model-based Off-policy Reinforcement Learning for Eco-Driving in Connected and Automated Hybrid Electric Vehicles | May 25, 2021 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Attention-based Reinforcement Learning for Real-Time UAV Semantic Communication | May 22, 2021 | Deep Reinforcement LearningGraph Attention | —Unverified | 0 |
| Gym-μRTS: Toward Affordable Full Game Real-time Strategy Games Research with Deep Reinforcement Learning | May 21, 2021 | Deep Reinforcement LearningGPU | CodeCode Available | 1 |
| Multi-Agent Deep Reinforcement Learning using Attentive Graph Neural Architectures for Real-Time Strategy Games | May 21, 2021 | Deep Reinforcement LearningGraph Attention | —Unverified | 0 |
| Towards a Sample Efficient Reinforcement Learning Pipeline for Vision Based Robotics | May 20, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Stochastic Composite Augmented Lagrangian Method For Reinforcement Learning | May 20, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Deep Reinforcement Learning for Optimal Stopping with Application in Financial Engineering | May 19, 2021 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 0 |
| Reinforcement Learning Assisted Oxygen Therapy for COVID-19 Patients Under Intensive Care | May 19, 2021 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Robo-Advising: Enhancing Investment with Inverse Optimization and Deep Reinforcement Learning | May 19, 2021 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Application of deep reinforcement learning for Indian stock trading automation | May 18, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Online Multimodal Transportation Planning using Deep Reinforcement Learning | May 18, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Mean Field Games Flock! The Reinforcement Learning Way | May 17, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| A Heuristically Assisted Deep Reinforcement Learning Approach for Network Slice Placement | May 14, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning Based Safe Decision Making for Highway Autonomous Driving | May 13, 2021 | Autonomous DrivingAutonomous Navigation | —Unverified | 0 |
| Adaptive Warm-Start MCTS in AlphaZero-like Deep Reinforcement Learning | May 13, 2021 | Board GamesDeep Reinforcement Learning | —Unverified | 0 |
| Principled Exploration via Optimistic Bootstrapping and Backward Induction | May 13, 2021 | Deep Reinforcement LearningEfficient Exploration | CodeCode Available | 0 |
| Adversarial Reinforcement Learning in Dynamic Channel Access and Power Control | May 12, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Spectral Normalisation for Deep Reinforcement Learning: an Optimisation Perspective | May 11, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Zero-Shot Reinforcement Learning on Graphs for Autonomous Exploration Under Uncertainty | May 11, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Value Iteration in Continuous Actions, States and Time | May 10, 2021 | Deep Reinforcement Learning | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach to Audio-Based Navigation in a Multi-Speaker Environment | May 10, 2021 | Deep Reinforcement LearningNavigate | CodeCode Available | 0 |
| Age of Information Aware VNF Scheduling in Industrial IoT Using Deep Reinforcement Learning | May 10, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Generative Actor-Critic: An Off-policy Algorithm Using the Push-forward Model | May 8, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| A parallel-network continuous quantitative trading model with GARCH and PPO | May 8, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Utilizing Skipped Frames in Action Repeats via Pseudo-Actions | May 7, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Deep reinforcement learning-designed radiofrequency waveform in MRI | May 7, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Context-Based Soft Actor Critic for Environments with Non-stationary Dynamics | May 7, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Meta-Learning-Based Deep Reinforcement Learning for Multiobjective Optimization Problems | May 6, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Time-Aware Q-Networks: Resolving Temporal Irregularity for Deep Reinforcement Learning | May 6, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Solve routing problems with a residual edge-graph attention neural network | May 6, 2021 | Combinatorial OptimizationDeep Reinforcement Learning | CodeCode Available | 1 |
| Learning Algorithms for Regenerative Stopping Problems with Applications to Shipping Consolidation in Logistics | May 5, 2021 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Safety Enhancement for Deep Reinforcement Learning in Autonomous Separation Assurance | May 5, 2021 | Data AugmentationDeep Reinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning for Adaptive Exploration of Unknown Environments | May 4, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| On Lottery Tickets and Minimal Task Representations in Deep Reinforcement Learning | May 4, 2021 | Behavioural cloningDeep Reinforcement Learning | —Unverified | 0 |
| Curious Exploration and Return-based Memory Restoration for Deep Reinforcement Learning | May 2, 2021 | Deep Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Discovering Diverse Athletic Jumping Strategies | May 2, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| BACKDOORL: Backdoor Attack against Competitive Reinforcement Learning | May 2, 2021 | Atari GamesBackdoor Attack | —Unverified | 0 |
| Pedestrian Collision Avoidance for Autonomous Vehicles at Unsignalized Intersection Using Deep Q-Network | May 1, 2021 | Autonomous VehiclesCollision Avoidance | —Unverified | 0 |
| Discrete-Time Mean Field Control with Environment States | Apr 30, 2021 | Deep Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Emotional Contagion-Aware Deep Reinforcement Learning for Antagonistic Crowd Simulation | Apr 29, 2021 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |