SOTAVerified

Model-based Reinforcement Learning

Papers

Showing 251300 of 708 papers

TitleStatusHype
Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs0
MoConVQ: Unified Physics-Based Motion Control via Scalable Discrete Representations0
COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL0
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement LearningCode0
Multi-timestep models for Model-based Reinforcement Learning0
Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning0
Amortized Network Intervention to Steer the Excitatory Point Processes0
Probabilistic Reach-Avoid for Bayesian Neural NetworksCode0
MoDem-V2: Visuo-Motor World Models for Real-World Robot Manipulation0
DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning0
Mind the Uncertainty: Risk-Aware and Actively Exploring Model-Based Reinforcement Learning0
Distributionally Robust Model-based Reinforcement Learning with Large State Spaces0
The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement LearningCode0
Value-Distributional Model-Based Reinforcement LearningCode0
Exploring the Potential of World Models for Anomaly Detection in Autonomous Driving0
Learning Disentangled Discrete RepresentationsCode0
Mode-constrained Model-based Reinforcement Learning via Gaussian ProcessesCode0
Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning0
Image Transformation Sequence Retrieval with General Reinforcement Learning0
Facing Off World Model Backbones: RNNs, Transformers, and S40
Surge Routing: Event-informed Multiagent Reinforcement Learning for Autonomous Rideshare0
λ-models: Effective Decision-Aware Reinforcement Learning with Latent Models0
Deep Generative Models for Decision-Making and Control0
How to Learn and Generalize From Three Minutes of Data: Physics-Constrained and Uncertainty-Aware Neural Stochastic Differential Equations0
Model-Based Reinforcement Learning with Multi-Task Offline PretrainingCode0
IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control0
What model does MuZero learn?0
Digital Twin-Based 3D Map Management for Edge-Assisted Mobile Augmented Reality0
Adaptive PD Control using Deep Reinforcement Learning for Local-Remote Teleoperation with Stochastic Time DelaysCode0
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching0
Bridging Active Exploration and Uncertainty-Aware Deployment Using Probabilistic Ensemble Neural Network Dynamics0
Sense, Imagine, Act: Multimodal Perception Improves Model-Based Reinforcement Learning for Head-to-Head Autonomous Racing0
A Survey on Offline Model-Based Reinforcement Learning0
Human Machine Co-adaption Interface via Cooperation Markov Decision Process System0
Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement LearningCode0
FLEX: an Adaptive Exploration Algorithm for Nonlinear SystemsCode0
Model Based Reinforcement Learning for Personalized Heparin Dosing0
Decision-Focused Model-based Reinforcement Learning for Reward Transfer0
State and Parameter Estimation for Affine Nonlinear Systems0
Risk-Sensitive and Robust Model-Based Reinforcement Learning and Planning0
EDGI: Equivariant Diffusion for Planning with Embodied Agents0
Dynamic Update-to-Data Ratio: Minimizing World Model OverfittingCode0
A New Policy Iteration Algorithm For Reinforcement Learning in Zero-Sum Markov Games0
On the Benefits of Leveraging Structural Information in Planning Over the Learned Model0
Replay Buffer with Local Forgetting for Adapting to Local Environment Changes in Deep Model-Based Reinforcement Learning0
Beware of Instantaneous Dependence in Reinforcement Learning0
Sample-efficient Real-time Planning with Curiosity Cross-Entropy Method and Contrastive LearningCode0
Approximating Energy Market Clearing and Bidding With Model-Based Reinforcement Learning0
The Virtues of Laziness in Model-based RL: A Unified Objective and AlgorithmsCode0
Understanding the effect of varying amounts of replay per step0
Show:102550
← PrevPage 6 of 15Next →

No leaderboard results yet.