| SPO: Sequential Monte Carlo Policy Optimisation | Feb 12, 2024 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 3 |
| Planning with Diffusion for Flexible Behavior Synthesis | May 20, 2022 | Decision MakingDenoising | CodeCode Available | 3 |
| iVideoGPT: Interactive VideoGPTs are Scalable World Models | May 24, 2024 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 2 |
| SINDy-RL: Interpretable and Efficient Model-Based Reinforcement Learning | Mar 14, 2024 | Deep Reinforcement LearningDictionary Learning | CodeCode Available | 2 |
| Mastering Memory Tasks with World Models | Mar 7, 2024 | Model-based Reinforcement LearningState Space Models | CodeCode Available | 2 |
| CoVO-MPC: Theoretical Analysis of Sampling-based MPC and Optimal Covariance Design | Jan 14, 2024 | Model-based Reinforcement LearningModel Predictive Control | CodeCode Available | 2 |
| TD-MPC2: Scalable, Robust World Models for Continuous Control | Oct 25, 2023 | continuous-controlContinuous Control | CodeCode Available | 2 |
| Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models | Oct 6, 2023 | Code GenerationDecision Making | CodeCode Available | 2 |
| MBRL-Lib: A Modular Library for Model-based Reinforcement Learning | Apr 20, 2021 | Model-based Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| Learning Accurate Long-term Dynamics for Model-based Reinforcement Learning | Dec 16, 2020 | Model-based Reinforcement LearningPrediction | CodeCode Available | 2 |
| Learning to Predict Without Looking Ahead: World Models Without Forward Prediction | Oct 29, 2019 | Model-based Reinforcement Learningreinforcement-learning | CodeCode Available | 2 |
| Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics Models | May 30, 2018 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 2 |
| Probabilistically safe and efficient model-based Reinforcement Learning | Apr 1, 2025 | Model-based Reinforcement LearningModel Predictive Control | CodeCode Available | 1 |
| Dream to Drive with Predictive Individual World Model | Jan 28, 2025 | Autonomous Drivingmodel | CodeCode Available | 1 |
| Diminishing Return of Value Expansion Methods | Dec 29, 2024 | Model-based Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Zero-shot Model-based Reinforcement Learning using Large Language Models | Oct 15, 2024 | In-Context LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics Models | Oct 11, 2024 | Model-based Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter Efficient | Oct 11, 2024 | MambaModel-based Reinforcement Learning | CodeCode Available | 1 |
| Learning Discrete World Models for Heuristic Search | Sep 14, 2024 | Deep Reinforcement LearningHeuristic Search | CodeCode Available | 1 |
| Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory control | Aug 30, 2024 | Model-based Reinforcement LearningReinforcement Learning (RL) | CodeCode Available | 1 |
| A Safe and Data-efficient Model-based Reinforcement Learning System for HVAC Control | Jul 16, 2024 | Model-based Reinforcement Learning | CodeCode Available | 1 |
| Generating Code World Models with Large Language Models Guided by Monte Carlo Tree Search | May 24, 2024 | Code GenerationLanguage Modelling | CodeCode Available | 1 |
| Efficient Multi-agent Reinforcement Learning by Planning | May 20, 2024 | Computational EfficiencyModel-based Reinforcement Learning | CodeCode Available | 1 |
| Learning Latent Dynamic Robust Representations for World Models | May 10, 2024 | Model-based Reinforcement Learning | CodeCode Available | 1 |
| CompilerDream: Learning a Compiler World Model for General Code Optimization | Apr 24, 2024 | DiversityModel-based Reinforcement Learning | CodeCode Available | 1 |