| PGPS : Coupling Policy Gradient with Population-based Search | Jan 1, 2021 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Incremental Policy Gradients for Online Reinforcement Learning Control | Jan 1, 2021 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Self-Supervised Continuous Control without Policy Gradient | Jan 1, 2021 | continuous-controlContinuous Control | —Unverified | 0 |
| 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition | Dec 29, 2020 | Action RecognitionPolicy Gradient Methods | —Unverified | 0 |
| Difference Rewards Policy Gradients | Dec 21, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| Model-free and Bayesian Ensembling Model-based Deep Reinforcement Learning for Particle Accelerator Control Demonstrated on the FERMI FEL | Dec 17, 2020 | Deep Reinforcement Learningmodel | CodeCode Available | 0 |
| Sample Complexity of Policy Gradient Finding Second-Order Stationary Points | Dec 2, 2020 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| Reinforcement Learning in Linear Quadratic Deep Structured Teams: Global Convergence of Policy Gradient Methods | Nov 29, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Policy Optimization for Markovian Jump Linear Quadratic Control: Gradient-Based Methods and Global Convergence | Nov 24, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Policy Gradient Methods for the Noisy Linear Quadratic Regulator over a Finite Horizon | Nov 20, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Optimal Control-Based Baseline for Guided Exploration in Policy Gradient Methods | Nov 4, 2020 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| A Study of Policy Gradient on a Class of Exactly Solvable Models | Nov 3, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient | Oct 27, 2020 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Sample Efficient Reinforcement Learning with REINFORCE | Oct 22, 2020 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Rethinking Deep Policy Gradients via State-Wise Policy Improvement | Oct 19, 2020 | Policy Gradient MethodsValue prediction | —Unverified | 0 |
| Evolutionary Selective Imitation: Interpretable Agents by Imitation Learning Without a Demonstrator | Sep 17, 2020 | Imitation LearningOpenAI Gym | —Unverified | 0 |
| Approximation Benefits of Policy Gradient Methods with Aggregated States | Jul 22, 2020 | Policy Gradient Methods | —Unverified | 0 |
| On Linear Convergence of Policy Gradient Methods for Finite MDPs | Jul 21, 2020 | Policy Gradient Methods | —Unverified | 0 |
| PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning | Jul 16, 2020 | Policy Gradient MethodsQ-Learning | CodeCode Available | 0 |
| Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization | Jul 13, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Momentum-Based Policy Gradient Methods | Jul 13, 2020 | Policy Gradient Methods | CodeCode Available | 0 |
| Policy Gradient Optimization of Thompson Sampling Policies | Jun 30, 2020 | Policy Gradient MethodsThompson Sampling | —Unverified | 0 |
| An operator view of policy gradient methods | Jun 19, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Lifelong Learning of Factored Policies via Policy Gradients | Jun 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Zeroth-Order Supervised Policy Improvement | Jun 11, 2020 | continuous-controlContinuous Control | —Unverified | 0 |