| Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution | Sep 29, 2020 | General Reinforcement LearningMinecraft | CodeCode Available | 1 | 5 |
| Intelligent Trading Systems: A Sentiment-Aware Reinforcement Learning Approach | Nov 14, 2021 | Algorithmic TradingGeneral Reinforcement Learning | CodeCode Available | 1 | 5 |
| NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning | May 21, 2025 | General Reinforcement LearningLogical Reasoning | CodeCode Available | 1 | 5 |
| Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design | Oct 4, 2023 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 1 | 5 |
| Learning to Backdoor Federated Learning | Mar 6, 2023 | Backdoor AttackFederated Learning | CodeCode Available | 0 | 5 |
| AIXIjs: A Software Demo for General Reinforcement Learning | May 22, 2017 | General Reinforcement LearningOpenAI Gym | CodeCode Available | 0 | 5 |
| Learning to Represent Action Values as a Hypergraph on the Action Vertices | Oct 28, 2020 | Atari GamesContinuous Control | CodeCode Available | 0 | 5 |
| Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge Distillation | Sep 1, 2021 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 0 | 5 |
| Doubly-Robust Estimation for Correcting Position-Bias in Click Feedback for Unbiased Learning to Rank | Mar 31, 2022 | counterfactualGeneral Reinforcement Learning | CodeCode Available | 0 | 5 |
| Interactive Learning from Activity Description | Feb 13, 2021 | General Reinforcement LearningGrounded language learning | CodeCode Available | 0 | 5 |