| Entropy Augmented Reinforcement Learning | Aug 19, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture | Sep 28, 2023 | Imitation LearningMuJoCo | —Unverified | 0 |
| Careful at Estimation and Bold at Exploration | Aug 22, 2023 | MuJoCo | —Unverified | 0 |
| Can Reinforcement Learning for Continuous Control Generalize Across Physics Engines? | Oct 27, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning | Feb 17, 2025 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| A Computational Theory of Learning Flexible Reward-Seeking Behavior with Place Cells | Apr 22, 2022 | MuJoCoOpen-Ended Question Answering | —Unverified | 0 |
| A Tractable Inference Perspective of Offline RL | Oct 31, 2023 | MuJoCoOffline RL | —Unverified | 0 |
| CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning | Mar 23, 2025 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Multiagent Model-based Credit Assignment for Continuous Control | Dec 27, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains | May 12, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO) | Feb 1, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Adaptive Q-Network: On-the-fly Target Selection for Deep Reinforcement Learning | May 25, 2024 | Atari GamesAutoML | —Unverified | 0 |
| ELSIM: End-to-end learning of reusable skills through intrinsic motivation | Jun 23, 2020 | Developmental LearningMuJoCo | —Unverified | 0 |
| An Intelligent Social Learning-based Optimization Strategy for Black-box Robotic Control with Reinforcement Learning | Nov 11, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration | Sep 27, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| A Computational Model of Learning Flexible Navigation in a Maze by Layout-Conforming Replay of Place Cells | Sep 18, 2022 | MuJoCo | —Unverified | 0 |
| Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling | Oct 7, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States | Oct 1, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| BlockPuzzle - A Challenge in Physical Reasoning and Generalization for Robot Learning | Nov 30, 2018 | Imitation LearningMuJoCo | —Unverified | 0 |
| Sim2Sim Evaluation of a Novel Data-Efficient Differentiable Physics Engine for Tensegrity Robots | Nov 10, 2020 | MuJoCo | —Unverified | 0 |
| Adaptive N-step Bootstrapping with Off-policy Data | Jan 1, 2021 | Atari GamesMuJoCo | —Unverified | 0 |
| Biased Estimates of Advantages over Path Ensembles | Sep 15, 2019 | Atari Gamescontinuous-control | —Unverified | 0 |
| Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble | Dec 7, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Effects of sparse rewards of different magnitudes in the speed of learning of model-based actor critic methods | Jan 18, 2020 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients | Jan 17, 2018 | MuJoCoSensitivity | —Unverified | 0 |