| Sim2Sim Evaluation of a Novel Data-Efficient Differentiable Physics Engine for Tensegrity Robots | Nov 10, 2020 | MuJoCo | —Unverified | 0 |
| Adaptive N-step Bootstrapping with Off-policy Data | Jan 1, 2021 | Atari GamesMuJoCo | —Unverified | 0 |
| Biased Estimates of Advantages over Path Ensembles | Sep 15, 2019 | Atari Gamescontinuous-control | —Unverified | 0 |
| Accelerating Self-Imitation Learning from Demonstrations via Policy Constraints and Q-Ensemble | Dec 7, 2022 | continuous-controlContinuous Control | —Unverified | 0 |
| Efficient Diversity-based Experience Replay for Deep Reinforcement Learning | Oct 27, 2024 | Atari GamesDecision Making | —Unverified | 0 |
| An Empirical Analysis of Proximal Policy Optimization with Kronecker-factored Natural Gradients | Jan 17, 2018 | MuJoCoSensitivity | —Unverified | 0 |
| Beyond Rewards: a Hierarchical Perspective on Offline Multiagent Behavioral Analysis | Jun 17, 2022 | MuJoCoStarcraft | —Unverified | 0 |
| A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem | May 26, 2023 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning | Apr 2, 2025 | MuJoCoUncertainty Quantification | —Unverified | 0 |
| Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration | Jun 25, 2025 | Imitation LearningMuJoCo | —Unverified | 0 |