| High-order Regularization for Machine Learning and Learning-based Control | May 13, 2025 | General Reinforcement Learning | —Unverified | 0 |
| Towards More Efficient, Robust, Instance-adaptive, and Generalizable Sequential Decision making | Apr 12, 2025 | Decision MakingDecision Making Under Uncertainty | —Unverified | 0 |
| The Problem of Social Cost in Multi-Agent General Reinforcement Learning: Survey and Synthesis | Dec 3, 2024 | General Reinforcement LearningMulti-agent Reinforcement Learning | —Unverified | 0 |
| Hypercube Policy Regularization Framework for Offline Reinforcement Learning | Nov 7, 2024 | D4RLGeneral Reinforcement Learning | CodeCode Available | 0 |
| Reinforcement Learning: Tutorial and Survey | Jul 18, 2024 | Deep Reinforcement LearningGeneral Reinforcement Learning | —Unverified | 0 |
| Dynamic Knowledge Injection for AIXI Agents | Dec 18, 2023 | General Reinforcement Learning | —Unverified | 0 |
| Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods | Oct 31, 2023 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Image Transformation Sequence Retrieval with General Reinforcement Learning | Jul 13, 2023 | General Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning | May 23, 2023 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Computably Continuous Reinforcement-Learning Objectives are PAC-learnable | Mar 9, 2023 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Policy Mirror Descent Inherently Explores Action Space | Mar 8, 2023 | Efficient ExplorationGeneral Reinforcement Learning | —Unverified | 0 |
| Learning to Backdoor Federated Learning | Mar 6, 2023 | Backdoor AttackFederated Learning | CodeCode Available | 0 |
| Computational Dualism and Objective Superintelligence | Feb 2, 2023 | General Reinforcement Learning | —Unverified | 0 |
| Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning | Dec 31, 2022 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| AcceRL: Policy Acceleration Framework for Deep Reinforcement Learning | Nov 28, 2022 | Decision MakingDeep Reinforcement Learning | —Unverified | 0 |
| Computable Artificial General Intelligence | May 21, 2022 | General Reinforcement LearningPhilosophy | —Unverified | 0 |
| D3PG: Dirichlet DDPG for Task Partitioning and Offloading With Constrained Hybrid Action Space in Mobile-Edge Computing | Apr 14, 2022 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |
| Doubly-Robust Estimation for Correcting Position-Bias in Click Feedback for Unbiased Learning to Rank | Mar 31, 2022 | counterfactualGeneral Reinforcement Learning | CodeCode Available | 0 |
| Abstractions of General Reinforcement Learning | Dec 26, 2021 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Reducing Planning Complexity of General Reinforcement Learning with Non-Markovian Abstractions | Dec 26, 2021 | Decision MakingGeneral Reinforcement Learning | —Unverified | 0 |
| Superior Performance with Diversified Strategic Control in FPS Games Using General Reinforcement Learning | Sep 29, 2021 | FPS GamesGeneral Reinforcement Learning | —Unverified | 0 |
| ^2-exploration for Reinforcement Learning | Sep 29, 2021 | General Reinforcement LearningQ-Learning | —Unverified | 0 |
| Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation | Sep 1, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 0 |
| A Policy Efficient Reduction Approach to Convex Constrained Deep Reinforcement Learning | Aug 29, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | —Unverified | 0 |
| Low-Resource Machine Translation based on Asynchronous Dynamic Programming | Aug 1, 2021 | General Reinforcement LearningLow Resource Neural Machine Translation | —Unverified | 0 |