| Policy Gradient in Partially Observable Environments: Approximation and Convergence | Oct 18, 2018 | Decision MakingPolicy Gradient Methods | —Unverified | 0 | 0 |
| Understanding Early Word Learning in Situated Artificial Agents | Oct 26, 2017 | Grounded language learningPolicy Gradient Methods | —Unverified | 0 | 0 |
| Understanding Grounded Language Learning Agents | Jan 1, 2018 | Grounded language learningPolicy Gradient Methods | —Unverified | 0 | 0 |
| Value-Based Reinforcement Learning for Continuous Control Robotic Manipulation in Multi-Task Sparse Reward Settings | Jul 28, 2021 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Variance Reduced Domain Randomization for Policy Gradient | Sep 29, 2021 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 | 0 |
| Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization | Jun 14, 2022 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines | Mar 20, 2018 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 | 0 |
| Variance Reduction for Reinforcement Learning in Input-Driven Environments | Jul 6, 2018 | Meta-LearningMuJoCo | —Unverified | 0 | 0 |
| Variance Reduction in Actor Critic Methods (ACM) | Jul 23, 2019 | Policy Gradient Methods | —Unverified | 0 | 0 |
| When Do Off-Policy and On-Policy Policy Gradient Methods Align? | Feb 19, 2024 | Policy Gradient Methods | —Unverified | 0 | 0 |