| Dropout Strategy in Reinforcement Learning: Limiting the Surrogate Objective Variance in Policy Optimization Methods | Oct 31, 2023 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| ReMax: A Simple, Effective, and Efficient Reinforcement Learning Method for Aligning Large Language Models | Oct 16, 2023 | General Reinforcement LearningGPU | CodeCode Available | 2 |
| Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design | Oct 4, 2023 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 1 |
| Image Transformation Sequence Retrieval with General Reinforcement Learning | Jul 13, 2023 | General Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| L-SA: Learning Under-Explored Targets in Multi-Target Reinforcement Learning | May 23, 2023 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Computably Continuous Reinforcement-Learning Objectives are PAC-learnable | Mar 9, 2023 | General Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Policy Mirror Descent Inherently Explores Action Space | Mar 8, 2023 | Efficient ExplorationGeneral Reinforcement Learning | —Unverified | 0 |
| Learning to Backdoor Federated Learning | Mar 6, 2023 | Backdoor AttackFederated Learning | CodeCode Available | 0 |
| Computational Dualism and Objective Superintelligence | Feb 2, 2023 | General Reinforcement Learning | —Unverified | 0 |
| Accuracy-Guaranteed Collaborative DNN Inference in Industrial IoT via Deep Reinforcement Learning | Dec 31, 2022 | Deep Reinforcement LearningEdge-computing | —Unverified | 0 |