| QKSA: Quantum Knowledge Seeking Agent | Jul 3, 2021 | Artificial LifeGeneral Reinforcement Learning | CodeCode Available | 0 |
| Nearest-Neighbor-based Collision Avoidance for Quadrotors via Reinforcement Learning | Apr 30, 2021 | Collision AvoidanceGeneral Reinforcement Learning | —Unverified | 0 |
| FaiR-IoT: Fairness-aware Human-in-the-Loop Reinforcement Learning for Harnessing Human Variability in Personalized IoT | Mar 30, 2021 | FairnessGeneral Reinforcement Learning | —Unverified | 0 |
| Interactive Learning from Activity Description | Feb 13, 2021 | General Reinforcement LearningGrounded language learning | CodeCode Available | 0 |
| A State Representation Dueling Network for Deep Reinforcement Learning | Dec 24, 2020 | Deep Reinforcement LearningGeneral Reinforcement Learning | —Unverified | 0 |
| Exact Reduction of Huge Action Spaces in General Reinforcement Learning | Dec 18, 2020 | BinarizationGeneral Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning of Causal Variables Using Mediation Analysis | Oct 29, 2020 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning to Represent Action Values as a Hypergraph on the Action Vertices | Oct 28, 2020 | Atari GamesContinuous Control | CodeCode Available | 0 |
| The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning | Jul 7, 2020 | General Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 0 |
| Local and Global Explanations of Agent Behavior: Integrating Strategy Summaries with Saliency Maps | May 18, 2020 | Atari GamesDecision Making | CodeCode Available | 0 |
| Learning as Reinforcement: Applying Principles of Neuroscience for More General Reinforcement Learning Agents | Apr 20, 2020 | Decision MakingGeneral Reinforcement Learning | —Unverified | 0 |
| Student/Teacher Advising through Reward Augmentation | Feb 7, 2020 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Model-Free Mean-Field Reinforcement Learning: Mean-Field MDP and Mean-Field Q-Learning | Oct 28, 2019 | General Reinforcement LearningQ-Learning | —Unverified | 0 |
| Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing field | Aug 13, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Goal-Driven Sequential Data Abstraction | Jul 29, 2019 | BenchmarkingGeneral Reinforcement Learning | —Unverified | 0 |
| Compositional Transfer in Hierarchical Reinforcement Learning | Jun 26, 2019 | General Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning | Jun 3, 2019 | General Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Variational Regret Bounds for Reinforcement Learning | May 14, 2019 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder | Mar 22, 2019 | DisentanglementGeneral Reinforcement Learning | —Unverified | 0 |
| Differential Temporal Difference Learning | Dec 28, 2018 | General Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Integrating Reinforcement Learning to Self Training for Pulmonary Nodule Segmentation in Chest X-rays | Nov 21, 2018 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Gibson Env: Real-World Perception for Embodied Agents | Aug 31, 2018 | Domain AdaptationGeneral Reinforcement Learning | CodeCode Available | 0 |
| Transferring Agent Behaviors from Videos via Motion GANs | Nov 21, 2017 | General Reinforcement LearningGenerative Adversarial Network | —Unverified | 0 |
| Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection | Nov 10, 2017 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning | Jun 19, 2017 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 |