| Evolutionary Selective Imitation: Interpretable Agents by Imitation Learning Without a Demonstrator | Sep 17, 2020 | Imitation LearningOpenAI Gym | —Unverified | 0 |
| Approximation Benefits of Policy Gradient Methods with Aggregated States | Jul 22, 2020 | Policy Gradient Methods | —Unverified | 0 |
| On Linear Convergence of Policy Gradient Methods for Finite MDPs | Jul 21, 2020 | Policy Gradient Methods | —Unverified | 0 |
| PC-PG: Policy Cover Directed Exploration for Provable Policy Gradient Learning | Jul 16, 2020 | Policy Gradient MethodsQ-Learning | CodeCode Available | 0 |
| Fast Global Convergence of Natural Policy Gradient Methods with Entropy Regularization | Jul 13, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Momentum-Based Policy Gradient Methods | Jul 13, 2020 | Policy Gradient Methods | CodeCode Available | 0 |
| Policy Gradient Optimization of Thompson Sampling Policies | Jun 30, 2020 | Policy Gradient MethodsThompson Sampling | —Unverified | 0 |
| An operator view of policy gradient methods | Jun 19, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Lifelong Learning of Factored Policies via Policy Gradients | Jun 12, 2020 | continuous-controlContinuous Control | —Unverified | 0 |
| Zeroth-Order Supervised Policy Improvement | Jun 11, 2020 | continuous-controlContinuous Control | —Unverified | 0 |