| Optimistic Exploration with Backward Bootstrapped Bonus for Deep Reinforcement Learning | Jan 1, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Practical Marginalized Importance Sampling with the Successor Representation | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| PGPS : Coupling Policy Gradient with Population-based Search | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Compute- and Memory-Efficient Reinforcement Learning with Latent Experience Replay | Jan 1, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Intrinsically Guided Exploration in Meta Reinforcement Learning | Jan 1, 2021 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Coordinated Multi-Agent Exploration Using Shared Goals | Jan 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning Latent Landmarks for Generalizable Planning | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Genetic Soft Updates for Policy Evolution in Deep Reinforcement Learning | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Weighted Bellman Backups for Improved Signal-to-Noise in Q-Updates | Jan 1, 2021 | Deep Reinforcement LearningQ-Learning | —Unverified | 0 |
| A Robust Fuel Optimization Strategy For Hybrid Electric Vehicles: A Deep Reinforcement Learning Based Continuous Time Design Approach | Jan 1, 2021 | Deep Reinforcement LearningManagement | —Unverified | 0 |
| Deep Q-Learning with Low Switching Cost | Jan 1, 2021 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| Deep Q Learning from Dynamic Demonstration with Behavioral Cloning | Jan 1, 2021 | Deep Reinforcement LearningOpenAI Gym | —Unverified | 0 |
| ReaPER: Improving Sample Efficiency in Model-Based Latent Imagination | Jan 1, 2021 | Deep Reinforcement Learning | —Unverified | 0 |
| Bounded Myopic Adversaries for Deep Reinforcement Learning Agents | Jan 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Simple Augmentation Goes a Long Way: ADRL for DNN Quantization | Jan 1, 2021 | Deep Reinforcement LearningQuantization | —Unverified | 0 |
| Regularization Matters in Policy Optimization - An Empirical Study on Continuous Control | Jan 1, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Benchmarking Multi-Agent Deep Reinforcement Learning Algorithms | Jan 1, 2021 | BenchmarkingDeep Reinforcement Learning | —Unverified | 0 |
| Playing Atari with Capsule Networks: A systematic comparison of CNN and CapsNets-based agents. | Jan 1, 2021 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Using Deep Reinforcement Learning to Train and Evaluate Instructional Sequencing Policies for an Intelligent Tutoring System | Jan 1, 2021 | Deep Reinforcement LearningKnowledge Tracing | —Unverified | 0 |
| Interpretable Reinforcement Learning With Neural Symbolic Logic | Jan 1, 2021 | Deep Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Factored Action Spaces in Deep Reinforcement Learning | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Monotonic Robust Policy Optimization with Model Discrepancy | Jan 1, 2021 | Deep Reinforcement LearningDiversity | —Unverified | 0 |
| Autonomous Maintenance in IoT Networks via AoI-driven Deep Reinforcement Learning | Dec 31, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Curriculum-based Deep Reinforcement Learning for Quantum Control | Dec 31, 2020 | Deep Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Robotic Grasping of Fully-Occluded Objects using RF Perception | Dec 31, 2020 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |