| Reinforcement Twinning: from digital twins to model-based reinforcement learning | Nov 7, 2023 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| DreamSmooth: Improving Model-based Reinforcement Learning via Reward Smoothing | Nov 2, 2023 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| The Alignment Ceiling: Objective Mismatch in Reinforcement Learning from Human Feedback | Oct 31, 2023 | GSM8KMMLU | —Unverified | 0 |
| Efficient Exploration in Continuous-time Model-based Reinforcement Learning | Oct 30, 2023 | Efficient ExplorationGaussian Processes | —Unverified | 0 |
| Benchmark Generation Framework with Customizable Distortions for Image Classifier Robustness | Oct 28, 2023 | Benchmarkingimage-classification | CodeCode Available | 0 |
| Relational Object-Centric Actor-Critic | Oct 26, 2023 | Model-based Reinforcement LearningObject | —Unverified | 0 |
| TD-MPC2: Scalable, Robust World Models for Continuous Control | Oct 25, 2023 | continuous-controlContinuous Control | CodeCode Available | 2 |
| Mind the Model, Not the Agent: The Primacy Bias in Model-based RL | Oct 23, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Tree Search in DAG Space with Model-based Reinforcement Learning for Causal Discovery | Oct 20, 2023 | Causal DiscoveryDecision Making | —Unverified | 0 |
| Value-Biased Maximum Likelihood Estimation for Model-based Reinforcement Learning in Discounted Linear MDPs | Oct 17, 2023 | Model-based Reinforcement Learning | —Unverified | 0 |
| MoConVQ: Unified Physics-Based Motion Control via Scalable Discrete Representations | Oct 16, 2023 | In-Context LearningModel-based Reinforcement Learning | —Unverified | 0 |
| STORM: Efficient Stochastic Transformer based World Models for Reinforcement Learning | Oct 14, 2023 | Atari Games 100kModel-based Reinforcement Learning | CodeCode Available | 1 |
| COPlanner: Plan to Roll Out Conservatively but to Explore Optimistically for Model-Based RL | Oct 11, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| A Unified View on Solving Objective Mismatch in Model-Based Reinforcement Learning | Oct 10, 2023 | Model-based Reinforcement Learning | CodeCode Available | 0 |
| Reward-Consistent Dynamics Models are Strongly Generalizable for Offline Reinforcement Learning | Oct 9, 2023 | D4RLModel-based Reinforcement Learning | —Unverified | 0 |
| Multi-timestep models for Model-based Reinforcement Learning | Oct 9, 2023 | modelModel-based Reinforcement Learning | —Unverified | 0 |
| Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models | Oct 6, 2023 | Code GenerationDecision Making | CodeCode Available | 2 |
| Amortized Network Intervention to Steer the Excitatory Point Processes | Oct 6, 2023 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Probabilistic Reach-Avoid for Bayesian Neural Networks | Oct 3, 2023 | Model-based Reinforcement Learning | CodeCode Available | 0 |
| HarmonyDream: Task Harmonization Inside World Models | Sep 30, 2023 | Atari Games 100kModel-based Reinforcement Learning | CodeCode Available | 1 |
| Consciousness-Inspired Spatio-Temporal Abstractions for Better Generalization in Reinforcement Learning | Sep 30, 2023 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 1 |
| MoDem-V2: Visuo-Motor World Models for Real-World Robot Manipulation | Sep 25, 2023 | Contact-rich ManipulationModel-based Reinforcement Learning | —Unverified | 0 |
| Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory Sampling | Sep 20, 2023 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 1 |
| DOMAIN: MilDly COnservative Model-BAsed OfflINe Reinforcement Learning | Sep 16, 2023 | D4RLmodel | —Unverified | 0 |
| Mind the Uncertainty: Risk-Aware and Actively Exploring Model-Based Reinforcement Learning | Sep 11, 2023 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Distributionally Robust Model-based Reinforcement Learning with Large State Spaces | Sep 5, 2023 | Gaussian ProcessesModel-based Reinforcement Learning | —Unverified | 0 |
| RePo: Resilient Model-Based Reinforcement Learning by Regularizing Posterior Predictability | Aug 31, 2023 | Model-based Reinforcement Learningreinforcement-learning | CodeCode Available | 1 |
| The Power of MEME: Adversarial Malware Creation with Model-Based Reinforcement Learning | Aug 31, 2023 | Adversarial Attack | CodeCode Available | 0 |
| Value-Distributional Model-Based Reinforcement Learning | Aug 12, 2023 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Exploring the Potential of World Models for Anomaly Detection in Autonomous Driving | Aug 10, 2023 | Anomaly DetectionAutonomous Driving | —Unverified | 0 |
| Learning Disentangled Discrete Representations | Jul 26, 2023 | Image GenerationModel-based Reinforcement Learning | CodeCode Available | 0 |
| Mode-constrained Model-based Reinforcement Learning via Gaussian Processes | Jul 25, 2023 | Gaussian ProcessesModel-based Reinforcement Learning | CodeCode Available | 0 |
| Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning | Jul 24, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Image Transformation Sequence Retrieval with General Reinforcement Learning | Jul 13, 2023 | General Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Surge Routing: Event-informed Multiagent Reinforcement Learning for Autonomous Rideshare | Jul 5, 2023 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Facing Off World Model Backbones: RNNs, Transformers, and S4 | Jul 5, 2023 | Model-based Reinforcement Learning | —Unverified | 0 |
| λ-models: Effective Decision-Aware Reinforcement Learning with Latent Models | Jun 30, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Curious Replay for Model-based Adaptation | Jun 28, 2023 | modelModel-based Reinforcement Learning | CodeCode Available | 1 |
| Deep Generative Models for Decision-Making and Control | Jun 15, 2023 | Decision MakingImage Inpainting | —Unverified | 0 |
| How to Learn and Generalize From Three Minutes of Data: Physics-Constrained and Uncertainty-Aware Neural Stochastic Differential Equations | Jun 10, 2023 | Inductive BiasModel-based Reinforcement Learning | —Unverified | 0 |
| Model-Based Reinforcement Learning with Multi-Task Offline Pretraining | Jun 6, 2023 | Knowledge DistillationModel-based Reinforcement Learning | CodeCode Available | 0 |
| IQL-TD-MPC: Implicit Q-Learning for Hierarchical Model Predictive Control | Jun 1, 2023 | D4RLModel-based Reinforcement Learning | —Unverified | 0 |
| What model does MuZero learn? | Jun 1, 2023 | modelModel-based Reinforcement Learning | —Unverified | 0 |
| Pre-training Contextualized World Models with In-the-wild Videos for Reinforcement Learning | May 29, 2023 | Autonomous DrivingDecoder | CodeCode Available | 1 |
| Digital Twin-Based 3D Map Management for Edge-Assisted Mobile Augmented Reality | May 26, 2023 | ManagementModel-based Reinforcement Learning | —Unverified | 0 |
| Adaptive PD Control using Deep Reinforcement Learning for Local-Remote Teleoperation with Stochastic Time Delays | May 26, 2023 | Deep Reinforcement LearningModel-based Reinforcement Learning | CodeCode Available | 0 |
| TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching | May 22, 2023 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| Bridging Active Exploration and Uncertainty-Aware Deployment Using Probabilistic Ensemble Neural Network Dynamics | May 20, 2023 | Autonomous VehiclesModel-based Reinforcement Learning | —Unverified | 0 |
| Sense, Imagine, Act: Multimodal Perception Improves Model-Based Reinforcement Learning for Head-to-Head Autonomous Racing | May 8, 2023 | Autonomous Racingcontinuous-control | —Unverified | 0 |
| A Survey on Offline Model-Based Reinforcement Learning | May 5, 2023 | modelModel-based Reinforcement Learning | —Unverified | 0 |