| Genetic Imitation Learning by Reward Extrapolation | Jan 3, 2023 | Imitation LearningMuJoCo | —Unverified | 0 |
| Contextual Conservative Q-Learning for Offline Reinforcement Learning | Jan 3, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Pontryagin Optimal Control via Neural Networks | Dec 30, 2022 | Model-based Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| On the Geometry of Reinforcement Learning in Continuous State and Action Spaces | Dec 29, 2022 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling | Dec 16, 2022 | MuJoCoQ-Learning | —Unverified | 0 |
| Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks | Dec 11, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble | Dec 7, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation | Dec 6, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets | Dec 5, 2022 | D4RLMuJoCo | CodeCode Available | 0 |
| Time-Efficient Reward Learning via Visually Assisted Cluster Ranking | Nov 30, 2022 | Dimensionality ReductionMuJoCo | —Unverified | 0 |