| Tesseract: Tensorised Actors for Multi-Agent Reinforcement Learning | May 31, 2021 | Learning TheoryMulti-agent Reinforcement Learning | —Unverified | 0 |
| Meta Learning the Step Size in Policy Gradient Methods | May 20, 2021 | Meta-LearningMeta Reinforcement Learning | —Unverified | 0 |
| Controlling an Inverted Pendulum with Policy Gradient Methods-A Tutorial | May 17, 2021 | OpenAI GymPolicy Gradient Methods | —Unverified | 0 |
| On the Linear convergence of Natural Policy Gradient Algorithm | May 4, 2021 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Semi-On-Policy Training for Sample Efficient Multi-Agent Policy Gradients | Apr 27, 2021 | Multi-agent Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Model-free Policy Learning with Reward Gradients | Mar 9, 2021 | Continuous Controlmodel | CodeCode Available | 1 |
| Softmax Policy Gradient Methods Can Take Exponential Time to Converge | Feb 22, 2021 | Policy Gradient Methods | —Unverified | 0 |
| Factored Policy Gradients: Leveraging Structure for Efficient Learning in MOMDPs | Feb 20, 2021 | Policy Gradient Methods | —Unverified | 0 |
| Strategic bidding in freight transport using deep reinforcement learning | Feb 18, 2021 | Deep Reinforcement LearningFairness | —Unverified | 0 |
| Provably Efficient Policy Optimization for Two-Player Zero-Sum Markov Games | Feb 17, 2021 | Policy Gradient MethodsVocal Bursts Valence Prediction | —Unverified | 0 |
| Independent Policy Gradient Methods for Competitive Reinforcement Learning | Jan 11, 2021 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Self-Supervised Continuous Control without Policy Gradient | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| Incremental Policy Gradients for Online Reinforcement Learning Control | Jan 1, 2021 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| PGPS : Coupling Policy Gradient with Population-based Search | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition | Dec 29, 2020 | Action RecognitionPolicy Gradient Methods | —Unverified | 0 |
| Difference Rewards Policy Gradients | Dec 21, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL | Dec 17, 2020 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| An Efficient Asynchronous Method for Integrating Evolutionary and Gradient-based Policy Search | Dec 10, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Sample Complexity of Policy Gradient Finding Second-Order Stationary Points | Dec 2, 2020 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| Learning Multi-Agent Communication through Structured Attentive Reasoning | Dec 1, 2020 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 1 |
| Reinforcement Learning in Linear Quadratic Deep Structured Teams: Global Convergence of Policy Gradient Methods | Nov 29, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Policy Optimization for Markovian Jump Linear Quadratic Control: Gradient-Based Methods and Global Convergence | Nov 24, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon | Nov 20, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Optimal Control-Based Baseline for Guided Exploration in Policy Gradient Methods | Nov 4, 2020 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| A Study of Policy Gradient on a Class of Exactly Solvable Models | Nov 3, 2020 | Policy Gradient Methods | —Unverified | 0 |