SOTAVerified

Model-based Reinforcement Learning

Papers

Showing 601650 of 708 papers

TitleStatusHype
A Model-Based Reinforcement Learning with Adversarial Training for Online RecommendationCode0
Machine Learning and System Identification for Estimation in Physical SystemsCode0
Self-Correcting Models for Model-Based Reinforcement LearningCode0
Semi-Infinitely Constrained Markov Decision Processes and Efficient Reinforcement LearningCode0
TransDreamerV3: Implanting Transformer In DreamerV3Code0
Learning to Fly via Deep Model-Based Reinforcement LearningCode0
The LoCA Regret: A Consistent Metric to Evaluate Model-Based Behavior in Reinforcement LearningCode0
GLIB: Efficient Exploration for Relational Model-Based Reinforcement Learning via Goal-Literal BabblingCode0
Model-Based Reinforcement Learning with Adversarial Training for Online RecommendationCode0
Generative Adversarial User Model for Reinforcement Learning Based Recommendation SystemCode0
Learning the Reward Function for a Misspecified ModelCode0
The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement LearningCode0
Deep Gaussian Covariance Network with Trajectory Sampling for Data-Efficient Policy SearchCode0
SimuDICE: Offline Policy Optimization Through World Model Updates and DICE EstimationCode0
Bisimulation metric for Model Predictive ControlCode0
Algorithmic Framework for Model-based Deep Reinforcement Learning with Theoretical GuaranteesCode0
On Rollouts in Model-Based Reinforcement LearningCode0
Understanding and Mitigating the Limitations of Prioritized Experience ReplayCode0
Learning State Representations via Retracing in Reinforcement LearningCode0
FORESEE: Prediction with Expansion-Compression Unscented Transform for Online Policy OptimizationCode0
FLEX: an Adaptive Exploration Algorithm for Nonlinear SystemsCode0
Safety Augmented Value Estimation from Demonstrations (SAVED): Safe Deep Model-Based RL for Sparse Cost Robotic TasksCode0
Mode-constrained Model-based Reinforcement Learning via Gaussian ProcessesCode0
Model-Advantage and Value-Aware Models for Model-Based Reinforcement Learning: Bridging the Gap in Theory and PracticeCode0
Deep Active Inference Agents for Delayed and Long-Horizon EnvironmentsCode0
SOLAR: Deep Structured Representations for Model-Based Reinforcement LearningCode0
Model-based deep reinforcement learning for accelerated learning from flow simulationsCode0
A Simple Decentralized Cross-Entropy MethodCode0
Exponential Family Model-Based Reinforcement Learning via Score MatchingCode0
A Unified View on Solving Objective Mismatch in Model-Based Reinforcement LearningCode0
Learning Sequential Latent Variable Models from Multimodal Time Series DataCode0
A General Framework for Structured Learning of Mechanical SystemsCode0
Trust, but verify: model-based exploration in sparse reward environmentsCode0
The Virtues of Laziness in Model-based RL: A Unified Objective and AlgorithmsCode0
Tight Regret Bounds for Model-Based Reinforcement Learning with Greedy PoliciesCode0
Learning Powerful Policies by Using Consistent Dynamics ModelCode0
Adaptive Discretization for Model-Based Reinforcement LearningCode0
Tools for Data-driven Modeling of Within-Hand Manipulation with Underactuated Adaptive HandsCode0
Benchmarking Model-Based Reinforcement LearningCode0
PC-MLP: Model-based Reinforcement Learning with Policy Cover Guided ExplorationCode0
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout AdaptionCode0
RLFlow: Optimising Neural Network Subgraph Transformation with World ModelsCode0
Perspective-Shifted Neuro-Symbolic World Models: A Framework for Socially-Aware Robot NavigationCode0
Accurate Uncertainties for Deep Learning Using Calibrated RegressionCode0
Towards biologically plausible Dreaming and Planning in recurrent spiking networksCode0
Exploring Model-based Planning with Policy NetworksCode0
Benchmark Generation Framework with Customizable Distortions for Image Classifier RobustnessCode0
PIPPS: Flexible Model-Based Policy Search Robust to the Curse of ChaosCode0
Adaptive PD Control using Deep Reinforcement Learning for Local-Remote Teleoperation with Stochastic Time DelaysCode0
Learning Multimodal Transition Dynamics for Model-Based Reinforcement LearningCode0
Show:102550
← PrevPage 13 of 15Next →

No leaderboard results yet.