| Policy Mirror Descent Inherently Explores Action Space | Mar 8, 2023 | Efficient ExplorationGeneral Reinforcement Learning | —Unverified | 0 |
| Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions | Dec 26, 2021 | Decision MakingGeneral Reinforcement Learning | —Unverified | 0 |
| Compositional Transfer in Hierarchical Reinforcement Learning | Jun 26, 2019 | General Reinforcement LearningHierarchical Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning of Speech Recognition System Based on Policy Gradient and Hypothesis Selection | Nov 10, 2017 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reinforcement Learning: Tutorial and Survey | Jul 18, 2024 | Deep Reinforcement LearningGeneral Reinforcement Learning | —Unverified | 0 |
| Reinforcement Learning via AIXI Approximation | Jul 13, 2010 | General Reinforcement LearningOpen-Ended Question Answering | —Unverified | 0 |
| ^2-exploration for Reinforcement Learning | Sep 29, 2021 | General Reinforcement LearningQ-Learning | —Unverified | 0 |
| Self-Modification of Policy and Utility Function in Rational Agents | May 10, 2016 | General Reinforcement LearningReinforcement Learning | —Unverified | 0 |
| Student/Teacher Advising through Reward Augmentation | Feb 7, 2020 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Superior Performance with Diversified Strategic Control in FPS Games Using General Reinforcement Learning | Sep 29, 2021 | FPS GamesGeneral Reinforcement Learning | —Unverified | 0 |
| The Problem of Social Cost in Multi-Agent General Reinforcement Learning: Survey and Synthesis | Dec 3, 2024 | General Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| The Sample-Complexity of General Reinforcement Learning | Aug 22, 2013 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making | Apr 12, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| Transferring Agent Behaviors from Videos via Motion GANs | Nov 21, 2017 | General Reinforcement LearningGenerative Adversarial Network | —Unverified | 0 |
| Low-Resource Machine Translation based on Asynchronous Dynamic Programming | Aug 1, 2021 | General Reinforcement LearningLow Resource Neural Machine Translation | —Unverified | 0 |
| L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning | May 23, 2023 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder | Mar 22, 2019 | DisentanglementGeneral Reinforcement Learning | —Unverified | 0 |
| Model-Free Mean-Field Reinforcement Learning: Mean-Field MDP and Mean-Field Q-Learning | Oct 28, 2019 | General Reinforcement LearningQ-Learning | —Unverified | 0 |
| Learning to Backdoor Federated Learning | Mar 6, 2023 | Backdoor AttackFederated Learning | CodeCode Available | 0 |
| Is Deep Reinforcement Learning Really Superhuman on Atari? Leveling the playing field | Aug 13, 2019 | Atari GamesDeep Reinforcement Learning | CodeCode Available | 0 |
| Learning to Represent Action Values as a Hypergraph on the Action Vertices | Oct 28, 2020 | Atari GamesContinuous Control | CodeCode Available | 0 |
| Local and Global Explanations of Agent Behavior: Integrating Strategy Summaries with Saliency Maps | May 18, 2020 | Atari GamesDecision Making | CodeCode Available | 0 |
| Doubly-Robust Estimation for Correcting Position-Bias in Click Feedback for Unbiased Learning to Rank | Mar 31, 2022 | counterfactualGeneral Reinforcement Learning | CodeCode Available | 0 |
| Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation | Sep 1, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 0 |
| Using a Logarithmic Mapping to Enable Lower Discount Factors in Reinforcement Learning | Jun 3, 2019 | General Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Interactive Learning from Activity Description | Feb 13, 2021 | General Reinforcement LearningGrounded language learning | CodeCode Available | 0 |
| The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement Learning | Jul 7, 2020 | General Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 0 |
| A Monte Carlo AIXI Approximation | Sep 4, 2009 | General Reinforcement LearningOpen-Ended Question Answering | CodeCode Available | 0 |
| Hypercube Policy Regularization Framework for Offline Reinforcement Learning | Nov 7, 2024 | D4RLGeneral Reinforcement Learning | CodeCode Available | 0 |
| Gibson Env: Real-World Perception for Embodied Agents | Aug 31, 2018 | Domain AdaptationGeneral Reinforcement Learning | CodeCode Available | 0 |
| AIXIjs: A Software Demo for General Reinforcement Learning | May 22, 2017 | General Reinforcement LearningOpenAI Gym | CodeCode Available | 0 |
| Dex: Incremental Learning for Complex Environments in Deep Reinforcement Learning | Jun 19, 2017 | Continual LearningDeep Reinforcement Learning | CodeCode Available | 0 |
| QKSA: Quantum Knowledge Seeking Agent | Jul 3, 2021 | Artificial LifeGeneral Reinforcement Learning | CodeCode Available | 0 |
| Generalised Discount Functions applied to a Monte-Carlo AImu Implementation | Mar 3, 2017 | General Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |