| Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards | Dec 26, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Bayesian Policy Gradients via Alpha Divergence Dropout Inference | Dec 6, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Leveraging exploration in off-policy algorithms via normalizing flows | May 16, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy | Jul 25, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Learning to Play Cup-and-Ball with Noisy Camera Observations | Jul 19, 2020 | MuJoCo | CodeCode Available | 0 | 5 |
| Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning | Oct 15, 2024 | D4RLModel-based Reinforcement Learning | CodeCode Available | 0 | 5 |
| Learning What To Do by Simulating the Past | Apr 8, 2021 | MuJoCo | CodeCode Available | 0 | 5 |
| Balancing Value Underestimation and Overestimation with Realistic Actor-Critic | Oct 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning | Jun 24, 2025 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Learning non-Markovian Decision-Making from State-only Sequences | Jun 27, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 | 5 |