| Experimental design for MRI by greedy policy search | Oct 30, 2020 | Experimental DesignPolicy Gradient Methods | CodeCode Available | 1 |
| Batch Reinforcement Learning with a Nonparametric Off-Policy Policy Gradient | Oct 27, 2020 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Sample Efficient Reinforcement Learning with REINFORCE | Oct 22, 2020 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Rethinking Deep Policy Gradients via State-Wise Policy Improvement | Oct 19, 2020 | Policy Gradient MethodsValue prediction | —Unverified | 0 |
| Efficient Wasserstein Natural Gradients for Reinforcement Learning | Oct 12, 2020 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 1 |
| Evolutionary Selective Imitation: Interpretable Agents by Imitation Learning Without a Demonstrator | Sep 17, 2020 | Imitation LearningOpenAI Gym | —Unverified | 0 |
| Approximation Benefits of Policy Gradient Methods with Aggregated States | Jul 22, 2020 | Policy Gradient Methods | —Unverified | 0 |
| On Linear Convergence of Policy Gradient Methods for Finite MDPs | Jul 21, 2020 | Policy Gradient Methods | —Unverified | 0 |
| PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning | Jul 16, 2020 | Policy Gradient MethodsQ-Learning | CodeCode Available | 0 |
| Lifelong Policy Gradient Learning of Factored Policies for Faster Training Without Forgetting | Jul 14, 2020 | Lifelong learningPolicy Gradient Methods | CodeCode Available | 1 |
| Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization | Jul 13, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Momentum-Based Policy Gradient Methods | Jul 13, 2020 | Policy Gradient Methods | CodeCode Available | 0 |
| Policy Gradient Optimization of Thompson Sampling Policies | Jun 30, 2020 | Policy Gradient MethodsThompson Sampling | —Unverified | 0 |
| Deep Bayesian Quadrature Policy Optimization | Jun 28, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| An operator view of policy gradient methods | Jun 19, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Competitive Policy Optimization | Jun 18, 2020 | Policy Gradient Methods | CodeCode Available | 1 |
| Lifelong Learning of Factored Policies via Policy Gradients | Jun 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Zeroth-Order Supervised Policy Improvement | Jun 11, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Jointly Learning Environments and Control Policies with Projected Stochastic Gradient Ascent | Jun 2, 2020 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Invariant Policy Optimization: Towards Stronger Generalization in Reinforcement Learning | Jun 1, 2020 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 1 |
| On the Global Convergence Rates of Softmax Policy Gradient Methods | May 13, 2020 | Open-Ended Question AnsweringPolicy Gradient Methods | —Unverified | 0 |
| Improving Sample Efficiency and Multi-Agent Communication in RL-based Train Rescheduling | Apr 28, 2020 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Safe Reinforcement Learning via Projection on a Safe Set: How to Achieve Optimality? | Apr 2, 2020 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Exchangeable Input Representations for Reinforcement Learning | Mar 19, 2020 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Stochastic Recursive Momentum for Policy Gradient Methods | Mar 9, 2020 | Policy Gradient Methods | —Unverified | 0 |