| Dynamic Algorithm Configuration: Foundation of a New Meta-Algorithmic Framework | Jun 1, 2020 | General Reinforcement Learning | CodeCode Available | 1 |
| Learning Exploration Policies for Navigation | Mar 5, 2019 | Efficient ExplorationGeneral Reinforcement Learning | CodeCode Available | 1 |
| Counterfactual Data Augmentation using Locally Factored Dynamics | Jul 6, 2020 | counterfactualData Augmentation | CodeCode Available | 1 |
| NOVER: Incentive Training for Language Models via Verifier-Free Reinforcement Learning | May 21, 2025 | General Reinforcement LearningLogical Reasoning | CodeCode Available | 1 |
| Align-RUDDER: Learning From Few Demonstrations by Reward Redistribution | Sep 29, 2020 | General Reinforcement LearningMinecraft | CodeCode Available | 1 |
| DeFIX: Detecting and Fixing Failure Scenarios with Reinforcement Learning in Imitation Learning Based Autonomous Driving | Oct 29, 2022 | Autonomous DrivingCARLA MAP Leaderboard | CodeCode Available | 1 |
| Discovering General Reinforcement Learning Algorithms with Adversarial Environment Design | Oct 4, 2023 | Deep Reinforcement LearningGeneral Reinforcement Learning | CodeCode Available | 1 |
| Data-Efficient Reinforcement Learning with Self-Predictive Representations | Jul 12, 2020 | Atari Games 100kData Augmentation | CodeCode Available | 1 |
| End-to-End Egospheric Spatial Memory | Feb 15, 2021 | General Reinforcement LearningImitation Learning | CodeCode Available | 1 |
| Learning Deformable Object Manipulation from Expert Demonstrations | Jul 20, 2022 | Deformable Object ManipulationGeneral Reinforcement Learning | CodeCode Available | 1 |