| Certifiably Robust Reinforcement Learning through Model-Based Abstract Interpretation | Jan 26, 2023 | Adversarial RobustnessMuJoCo | —Unverified | 0 |
| Partial advantage estimator for proximal policy optimization | Jan 26, 2023 | MuJoCoPolicy Gradient Methods | CodeCode Available | 1 |
| Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout | Jan 26, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 0 |
| Joint action loss for proximal policy optimization | Jan 26, 2023 | Dota 2MuJoCo | CodeCode Available | 1 |
| Actor-Director-Critic: A Novel Deep Reinforcement Learning Framework | Jan 10, 2023 | Action ClassificationDecision Making | —Unverified | 0 |
| Genetic Imitation Learning by Reward Extrapolation | Jan 3, 2023 | Imitation LearningMuJoCo | —Unverified | 0 |
| Contextual Conservative Q-Learning for Offline Reinforcement Learning | Jan 3, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Pontryagin Optimal Control via Neural Networks | Dec 30, 2022 | Model-based Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| On the Geometry of Reinforcement Learning in Continuous State and Action Spaces | Dec 29, 2022 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Offline Robot Reinforcement Learning with Uncertainty-Guided Human Expert Sampling | Dec 16, 2022 | MuJoCoQ-Learning | —Unverified | 0 |
| Off-Policy Deep Reinforcement Learning Algorithms for Handling Various Robotic Manipulator Tasks | Dec 11, 2022 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble | Dec 7, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation | Dec 6, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed Datasets | Dec 5, 2022 | D4RLMuJoCo | CodeCode Available | 0 |
| Time-Efficient Reward Learning via Visually Assisted Cluster Ranking | Nov 30, 2022 | Dimensionality ReductionMuJoCo | —Unverified | 0 |
| Continuous Neural Algorithmic Planners | Nov 29, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Learning from Good Trajectories in Offline Multi-Agent Reinforcement Learning | Nov 28, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| On the Effect of Pre-training for Transformer in Different Modality on Offline Reinforcement Learning | Nov 17, 2022 | MuJoCo | CodeCode Available | 0 |
| Contextual Transformer for Offline Meta Reinforcement Learning | Nov 15, 2022 | D4RLMeta Reinforcement Learning | —Unverified | 0 |
| Out-of-Dynamics Imitation Learning from Multimodal Demonstrations | Nov 13, 2022 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Control Transformer: Robot Navigation in Unknown Environments through PRM-Guided Return-Conditioned Sequence Modeling | Nov 11, 2022 | MuJoCoNavigate | —Unverified | 0 |
| Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification | Nov 7, 2022 | MuJoCo | CodeCode Available | 1 |
| Reward Shaping Using Convolutional Neural Network | Oct 30, 2022 | MuJoCo | —Unverified | 0 |
| Imitating Opponent to Win: Adversarial Policy Imitation Learning in Two-player Competitive Games | Oct 30, 2022 | Deep Reinforcement LearningImitation Learning | —Unverified | 0 |
| Group Distributionally Robust Reinforcement Learning with Hierarchical Latent Variables | Oct 21, 2022 | MuJoCoreinforcement-learning | —Unverified | 0 |