| Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator | Jan 30, 2024 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture | Sep 28, 2023 | Imitation LearningMuJoCo | —Unverified | 0 | 0 |
| A Tractable Inference Perspective of Offline RL | Oct 31, 2023 | MuJoCoOffline RL | —Unverified | 0 | 0 |
| Careful at Estimation and Bold at Exploration | Aug 22, 2023 | MuJoCo | —Unverified | 0 | 0 |
| Expected Policy Gradients | Jun 15, 2017 | MuJoCoReinforcement Learning | —Unverified | 0 | 0 |
| Can Reinforcement Learning for Continuous Control Generalize Across Physics Engines? | Oct 27, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning | Feb 17, 2025 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 | 0 |
| A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning | Dec 12, 2023 | MuJoCoOffline RL | —Unverified | 0 | 0 |
| A Computational Theory of Learning Flexible Reward-Seeking Behavior with Place Cells | Apr 22, 2022 | MuJoCoOpen-Ended Question Answering | —Unverified | 0 | 0 |
| Evolving Rewards to Automate Reinforcement Learning | May 18, 2019 | continuous-controlContinuous Control | —Unverified | 0 | 0 |