SOTAVerified

Model-based Reinforcement Learning

Papers

Showing 150 of 708 papers

TitleStatusHype
SPO: Sequential Monte Carlo Policy OptimisationCode3
Planning with Diffusion for Flexible Behavior SynthesisCode3
iVideoGPT: Interactive VideoGPTs are Scalable World ModelsCode2
SINDy-RL: Interpretable and Efficient Model-Based Reinforcement LearningCode2
Mastering Memory Tasks with World ModelsCode2
CoVO-MPC: Theoretical Analysis of Sampling-based MPC and Optimal Covariance DesignCode2
TD-MPC2: Scalable, Robust World Models for Continuous ControlCode2
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language ModelsCode2
MBRL-Lib: A Modular Library for Model-based Reinforcement LearningCode2
Learning Accurate Long-term Dynamics for Model-based Reinforcement LearningCode2
Learning to Predict Without Looking Ahead: World Models Without Forward PredictionCode2
Deep Reinforcement Learning in a Handful of Trials using Probabilistic Dynamics ModelsCode2
Probabilistically safe and efficient model-based Reinforcement LearningCode1
Dream to Drive with Predictive Individual World ModelCode1
Diminishing Return of Value Expansion MethodsCode1
Zero-shot Model-based Reinforcement Learning using Large Language ModelsCode1
Drama: Mamba-Enabled Model-Based Reinforcement Learning Is Sample and Parameter EfficientCode1
Learning to Walk from Three Minutes of Real-World Data with Semi-structured Dynamics ModelsCode1
Learning Discrete World Models for Heuristic SearchCode1
Traffic expertise meets residual RL: Knowledge-informed model-based residual reinforcement learning for CAV trajectory controlCode1
A Safe and Data-efficient Model-based Reinforcement Learning System for HVAC ControlCode1
Generating Code World Models with Large Language Models Guided by Monte Carlo Tree SearchCode1
Efficient Multi-agent Reinforcement Learning by PlanningCode1
Learning Latent Dynamic Robust Representations for World ModelsCode1
CompilerDream: Learning a Compiler World Model for General Code OptimizationCode1
Model-based Reinforcement Learning for Parameterized Action SpacesCode1
A Distributional Analogue to the Successor RepresentationCode1
Sample-Efficient Learning to Solve a Real-World Labyrinth Game Using Data-Augmented Model-Based Reinforcement LearningCode1
Reinforcement Learning with Model Predictive Control for Highway Ramp MeteringCode1
STORM: Efficient Stochastic Transformer based World Models for Reinforcement LearningCode1
Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement LearningCode1
HarmonyDream: Task Harmonization Inside World ModelsCode1
Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory SamplingCode1
RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior PredictabilityCode1
Curious Replay for Model-based AdaptationCode1
Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement LearningCode1
Posterior Sampling for Deep Reinforcement LearningCode1
Sample-efficient Model-based Reinforcement Learning for Quantum ControlCode1
Model-Based Reinforcement Learning with Isolated ImaginationsCode1
Transformer-based World Models Are Happy With 100k InteractionsCode1
Predictive Experience Replay for Continual Visual Control and ForecastingCode1
Diminishing Return of Value Expansion Methods in Model-Based Reinforcement LearningCode1
TrafficBots: Towards World Models for Autonomous Driving Simulation and Motion PredictionCode1
Model-Based Uncertainty in Value FunctionsCode1
Predictable MDP Abstraction for Unsupervised Model-Based RLCode1
MoDem: Accelerating Visual Model-Based Reinforcement Learning with DemonstrationsCode1
Physics-Informed Model-Based Reinforcement LearningCode1
The Effectiveness of World Models for Continual Reinforcement LearningCode1
Learning safety in model-based Reinforcement Learning using MPC and Gaussian ProcessesCode1
On the Feasibility of Cross-Task Transfer with Model-Based Reinforcement LearningCode1
Show:102550
← PrevPage 1 of 15Next →

No leaderboard results yet.