| Episodic Reinforcement Learning with Expanded State-reward Space | Jan 19, 2024 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-RL | May 14, 2021 | Inductive BiasMeta Reinforcement Learning | —Unverified | 0 | 0 |
| Evaluating Robustness of Cooperative MARL | Sep 29, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication | Jun 20, 2023 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 | 0 |
| Evolving Rewards to Automate Reinforcement Learning | May 18, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Expected Policy Gradients | Jun 15, 2017 | MuJoCoReinforcement Learning | —Unverified | 0 | 0 |
| A Tractable Inference Perspective of Offline RL | Oct 31, 2023 | MuJoCoOffline RL | —Unverified | 0 | 0 |
| Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator | Jan 30, 2024 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| Fast Adaptation to New Environments via Policy-Dynamics Value Functions | Jan 1, 2020 | MuJoCo | —Unverified | 0 | 0 |
| Fast Convergence of Softmax Policy Mirror Ascent | Nov 18, 2024 | MuJoCo | —Unverified | 0 | 0 |