| SAGE: Generating Symbolic Goals for Myopic Models in Deep Reinforcement Learning | Mar 9, 2022 | Deep Reinforcement LearningMinecraft | CodeCode Available | 0 |
| DreamingV2: Reinforcement Learning with Discrete World Models without Reconstruction | Mar 1, 2022 | Contrastive LearningModel-based Reinforcement Learning | —Unverified | 0 |
| TransDreamer: Reinforcement Learning with Transformer World Models | Feb 19, 2022 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Should I send this notification? Optimizing push notifications decision making by modeling the future | Feb 17, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Provably Efficient Causal Model-Based Reinforcement Learning for Systematic Generalization | Feb 14, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Neural NID Rules | Feb 12, 2022 | Common Sense ReasoningGraph Neural Network | —Unverified | 0 |
| Reward-Respecting Subtasks for Model-Based Reinforcement Learning | Feb 7, 2022 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Meta-Learning Hypothesis Spaces for Sequential Decision-making | Feb 1, 2022 | Bayesian OptimizationDecision Making | —Unverified | 0 |
| Bellman Meets Hawkes: Model-Based Reinforcement Learning via Temporal Point Processes | Jan 29, 2022 | Decision MakingModel-based Reinforcement Learning | CodeCode Available | 0 |
| Joint Differentiable Optimization and Verification for Certified Reinforcement Learning | Jan 28, 2022 | Bilevel OptimizationModel-based Reinforcement Learning | —Unverified | 0 |
| Physical Derivatives: Computing policy gradients by physical forward-propagation | Jan 15, 2022 | Model-based Reinforcement Learning | —Unverified | 0 |
| Opportunities of Hybrid Model-based Reinforcement Learning for Cell Therapy Manufacturing Process Control | Jan 10, 2022 | Decision MakingModel-based Reinforcement Learning | —Unverified | 0 |
| Assessing Policy, Loss and Planning Combinations in Reinforcement Learning using a New Modular Architecture | Jan 8, 2022 | Deep Reinforcement LearningModel-based Reinforcement Learning | —Unverified | 0 |
| Exponential Family Model-Based Reinforcement Learning via Score Matching | Dec 28, 2021 | Density EstimationModel-based Reinforcement Learning | CodeCode Available | 0 |
| Exploration-exploitation trade-off for continuous-time episodic reinforcement learning with linear-convex models | Dec 19, 2021 | Model-based Reinforcement LearningReinforcement Learning (RL) | —Unverified | 0 |
| CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning | Dec 14, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Model-Value Inconsistency as a Signal for Epistemic Uncertainty | Dec 8, 2021 | Model-based Reinforcement LearningRolling Shutter Correction | —Unverified | 0 |
| ED2: Environment Dynamics Decomposition World Models for Continuous Control | Dec 6, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Maximum Entropy Model-based Reinforcement Learning | Dec 2, 2021 | Dota 2model | —Unverified | 0 |
| Sample Complexity of Robust Reinforcement Learning with a Generative Model | Dec 2, 2021 | Model-based Reinforcement Learningreinforcement-learning | CodeCode Available | 0 |
| Weighted model estimation for offline model-based reinforcement learning | Dec 1, 2021 | Density Ratio Estimationmodel | —Unverified | 0 |
| Model-Based Reinforcement Learning via Imagination with Derived Memory | Dec 1, 2021 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Improving gearshift controllers for electric vehicles with reinforcement learning | Dec 1, 2021 | Experimental DesignModel-based Reinforcement Learning | —Unverified | 0 |
| Understanding End-to-End Model-Based Reinforcement Learning Methods as Implicit Parameterization | Dec 1, 2021 | Model-based Reinforcement Learningreinforcement-learning | —Unverified | 0 |
| Learning State Representations via Retracing in Reinforcement Learning | Nov 24, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |