| Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling | Oct 7, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| ELSIM: End-to-end learning of reusable skills through intrinsic motivation | Jun 23, 2020 | Developmental LearningMuJoCo | —Unverified | 0 | 0 |
| Enhanced DACER Algorithm with High Diffusion Efficiency | May 29, 2025 | DenoisingImitation Learning | —Unverified | 0 | 0 |
| EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning | Jul 22, 2018 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Entropy Augmented Reinforcement Learning | Aug 19, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Episodic Reinforcement Learning with Expanded State-reward Space | Jan 19, 2024 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-RL | May 14, 2021 | Inductive BiasMeta Reinforcement Learning | —Unverified | 0 | 0 |
| Evaluating Robustness of Cooperative MARL | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication | Jun 20, 2023 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 | 0 |
| Evolving Rewards to Automate Reinforcement Learning | May 18, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Expected Policy Gradients | Jun 15, 2017 | MuJoCoReinforcement Learning | —Unverified | 0 | 0 |
| A Tractable Inference Perspective of Offline RL | Oct 31, 2023 | MuJoCoOffline RL | —Unverified | 0 | 0 |
| Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator | Jan 30, 2024 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Fast Adaptation to New Environments via Policy-Dynamics Value Functions | Jan 1, 2020 | MuJoCo | —Unverified | 0 | 0 |
| Fast Convergence of Softmax Policy Mirror Ascent | Nov 18, 2024 | MuJoCo | —Unverified | 0 | 0 |
| FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control | May 28, 2025 | GPUHumanoid Control | —Unverified | 0 | 0 |
| Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments | Jul 19, 2022 | MuJoCoreinforcement-learning | —Unverified | 0 | 0 |
| Fight fire with fire: countering bad shortcuts in imitation learning with good shortcuts | Sep 29, 2021 | Autonomous Drivingcontinuous-control | —Unverified | 0 | 0 |
| Fighting Fire with Fire: Avoiding DNN Shortcuts through Priming | Jun 22, 2022 | Autonomous DrivingClassification | —Unverified | 0 | 0 |
| Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization | Jan 1, 2021 | D4RLMuJoCo | —Unverified | 0 | 0 |
| First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation | Dec 6, 2022 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals | Aug 5, 2020 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 | 0 |
| Formal Language Constrained Markov Decision Processes | Jan 1, 2021 | MuJoCo | —Unverified | 0 | 0 |
| FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation with Parameter-Sharing Versatility | Oct 8, 2023 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| From proprioception to long-horizon planning in novel environments: A hierarchical RL model | Jun 11, 2020 | Efficient ExplorationModel Predictive Control | —Unverified | 0 | 0 |