| Policy Gradient Methods for Distortion Risk Measures | Jul 9, 2021 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Curious Explorer: a provable exploration strategy in Policy Learning | Jun 29, 2021 | Policy Gradient Methods | —Unverified | 0 |
| Modularity in Reinforcement Learning via Algorithmic Independence in Credit Assignment | Jun 28, 2021 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| End-to-End Neuro-Symbolic Architecture for Image-to-Image Reasoning Tasks | Jun 6, 2021 | Image ReconstructionPolicy Gradient Methods | —Unverified | 0 |
| Ad Headline Generation using Self-Critical Masked Language Model | Jun 1, 2021 | Headline GenerationLanguage Modeling | —Unverified | 0 |
| Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning | May 31, 2021 | Learning TheoryMulti-agent Reinforcement Learning | —Unverified | 0 |
| Meta Learning the Step Size in Policy Gradient Methods | May 20, 2021 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 |
| Controlling an Inverted Pendulum with Policy Gradient Methods-A Tutorial | May 17, 2021 | OpenAI GymPolicy Gradient Methods | —Unverified | 0 |
| On the Linear convergence of Natural Policy Gradient Algorithm | May 4, 2021 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Semi-On-Policy Training for Sample Efficient Multi-Agent Policy Gradients | Apr 27, 2021 | Multi-agent Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |