| Q-learning with Language Model for Edit-based Unsupervised Summarization | Oct 9, 2020 | Abstractive Text SummarizationDecoder | CodeCode Available | 1 |
| EpidemiOptim: A Toolbox for the Optimization of Control Policies in Epidemiological Models | Oct 9, 2020 | Deep Reinforcement LearningEpidemiology | CodeCode Available | 1 |
| Energy-based Surprise Minimization for Multi-Agent Value Factorization | Sep 16, 2020 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Deep Active Inference for Partially Observable MDPs | Sep 8, 2020 | Deep Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Table2Charts: Recommending Charts by Learning Shared Table Representations | Aug 24, 2020 | Q-LearningRecommendation Systems | CodeCode Available | 1 |
| Robust Deep Reinforcement Learning through Adversarial Loss | Aug 5, 2020 | Adversarial AttackAtari Games | CodeCode Available | 1 |
| Deep Inverse Q-learning with Constraints | Aug 4, 2020 | Q-Learning | CodeCode Available | 1 |
| QPLEX: Duplex Dueling Multi-Agent Q-Learning | Aug 3, 2020 | Decision MakingMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| SUNRISE: A Simple Unified Framework for Ensemble Learning in Deep Reinforcement Learning | Jul 9, 2020 | Deep Reinforcement LearningDiversity | CodeCode Available | 1 |
| Neural Interactive Collaborative Filtering | Jul 4, 2020 | Collaborative FilteringMeta-Learning | CodeCode Available | 1 |
| Reward Machines for Cooperative Multi-Agent Reinforcement Learning | Jul 3, 2020 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Gradient Temporal-Difference Learning with Regularized Corrections | Jul 1, 2020 | Q-Learning | CodeCode Available | 1 |
| Image Classification by Reinforcement Learning with Two-State Q-Learning | Jun 28, 2020 | ClassificationGeneral Classification | CodeCode Available | 1 |
| Weighted QMIX: Expanding Monotonic Value Function Factorisation for Deep Multi-Agent Reinforcement Learning | Jun 18, 2020 | Multi-agent Reinforcement LearningQ-Learning | CodeCode Available | 1 |
| Semantic Visual Navigation by Watching YouTube Videos | Jun 17, 2020 | Q-LearningVisual Navigation | CodeCode Available | 1 |
| Conservative Q-Learning for Offline Reinforcement Learning | Jun 8, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Multi-Agent Determinantal Q-Learning | Jun 2, 2020 | Q-Learning | CodeCode Available | 1 |
| Modeling Penetration Testing with Reinforcement Learning Using Capture-the-Flag Challenges: Trade-offs between Model-free Learning and A Priori Knowledge | May 26, 2020 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| Spatial Action Maps for Mobile Manipulation | Apr 20, 2020 | Q-LearningValue prediction | CodeCode Available | 1 |
| Using Deep Reinforcement Learning Methods for Autonomous Vessels in 2D Environments | Mar 23, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| FlapAI Bird: Training an Agent to Play Flappy Bird Using Reinforcement Learning Techniques | Mar 21, 2020 | Q-Learningreinforcement-learning | CodeCode Available | 1 |
| DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction | Mar 16, 2020 | Deep Reinforcement LearningMeta-Learning | CodeCode Available | 1 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Optimistic Exploration even with a Pessimistic Initialisation | Feb 26, 2020 | Efficient ExplorationQ-Learning | CodeCode Available | 1 |
| Maxmin Q-learning: Controlling the Estimation Bias of Q-learning | Feb 16, 2020 | Q-Learning | CodeCode Available | 1 |