SOTAVerified

Model-based Reinforcement Learning

Papers

Showing 150 of 708 papers

TitleStatusHype
TransDreamerV3: Implanting Transformer In DreamerV3Code0
On Quantum BSDE Solver for High-Dimensional Parabolic PDEs0
Relative Entropy Regularized Reinforcement Learning for Efficient Encrypted Policy Synthesis0
Accelerating Model-Based Reinforcement Learning using Non-Linear Trajectory Optimization0
Bregman Centroid Guided Cross-Entropy Method0
World Models for Cognitive Agents: Transforming Edge Intelligence in Future Networks0
Calibrated Value-Aware Model Learning with Stochastic Environment Models0
Deep Active Inference Agents for Delayed and Long-Horizon EnvironmentsCode0
MedDreamer: Model-Based Reinforcement Learning with Latent Imagination on Complex EHRs for Clinical Decision Support0
JEDI: Latent End-to-end Diffusion Mitigates Agent-Human Performance Asymmetry in Model-Based Reinforcement Learning0
Gaze Into the Abyss -- Planning to Seek Entropy When Reward is Scarce0
Raw2Drive: Reinforcement Learning with Aligned World Models for End-to-End Autonomous Driving (in CARLA v2)0
Improving planning and MBRL with temporally-extended actions0
Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning0
Temporal Distance-aware Transition Augmentation for Offline Model-based Reinforcement Learning0
Multi-Goal Dexterous Hand Manipulation using Probabilistic Model-based Reinforcement Learning0
Data-Assimilated Model-Based Reinforcement Learning for Partially Observed Chaotic Flows0
PIN-WM: Learning Physics-INformed World Models for Non-Prehensile Manipulation0
Learning global control of underactuated systems with Model-Based Reinforcement Learning0
Probabilistic Pontryagin's Maximum Principle for Continuous-Time Model-Based Reinforcement LearningCode0
Learning with Imperfect Models: When Multi-step Prediction Mitigates Compounding Error0
Probabilistically safe and efficient model-based Reinforcement LearningCode1
Look Before Leap: Look-Ahead Planning with Uncertainty in Reinforcement Learning0
Perspective-Shifted Neuro-Symbolic World Models: A Framework for Socially-Aware Robot NavigationCode0
Continual Reinforcement Learning for HVAC Systems Control: Integrating Hypernetworks and Transfer LearningCode0
Counterfactual experience augmented off-policy reinforcement learningCode0
Entropy-regularized Gradient Estimators for Approximate Bayesian Inference0
Towards Causal Model-Based Policy Optimization0
Enhancing Traffic Signal Control through Model-based Reinforcement Learning and Policy Reuse0
InDRiVE: Intrinsic Disagreement based Reinforcement for Vehicle Exploration through Curiosity Driven Generalized World Model0
Knowledge Retention for Continual Model-Based Reinforcement Learning0
Learning Transformer-based World Models with Contrastive Predictive Coding0
World Models for Anomaly Detection during Model-Based Reinforcement Learning Inference0
Differentiable Information Enhanced Model-Based Reinforcement Learning0
Spiking World Model with Multi-Compartment Neurons for Model-based Reinforcement LearningCode0
Multimodal Dreaming: A Global Workspace Approach to World Model-Based Reinforcement Learning0
Accelerating Model-Based Reinforcement Learning with State-Space World Models0
Enhancing Offline Model-Based RL via Active Model Selection: A Bayesian Optimization Perspective0
PrivilegedDreamer: Explicit Imagination of Privileged Information for Rapid Adaptation of Learned Policies0
Towards Empowerment Gain through Causal Structure Learning in Model-Based RL0
Pre-Trained Video Generative Models as World Simulators0
TD-M(PC)^2: Improving Temporal Difference MPC Through Policy Constraint0
Dream to Drive with Predictive Individual World ModelCode1
On Rollouts in Model-Based Reinforcement LearningCode0
Objects matter: object-centric world models improve reinforcement learning in visually complex environments0
Generative AI for Lyapunov Optimization Theory in UAV-based Low-Altitude Economy Networking0
AdaWM: Adaptive World Model based Planning for Autonomous Driving0
GLAM: Global-Local Variation Awareness in Mamba-based World Model0
Robotic World Model: A Neural Network Simulator for Robust Policy Optimization in Robotics0
EVaDE : Event-Based Variational Thompson Sampling for Model-Based Reinforcement Learning0
Show:102550
← PrevPage 1 of 15Next →

No leaderboard results yet.