| DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning | Sep 18, 2019 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 |
| Deep Reinforcement Learning Algorithm for Dynamic Pricing of Express Lanes with Multiple Access Locations | Sep 10, 2019 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Transfer Reward Learning for Policy Gradient-Based Text Generation | Sep 9, 2019 | Conditional Text GenerationImage Captioning | —Unverified | 0 |
| Multi Pseudo Q-learning Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles | Sep 7, 2019 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Neural Policy Gradient Methods: Global Optimality and Rates of Convergence | Aug 29, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods | Aug 8, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Health-Informed Policy Gradients for Multi-Agent Reinforcement Learning | Aug 2, 2019 | Multi-agent Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift | Aug 1, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Hindsight Trust Region Policy Optimization | Jul 29, 2019 | Atari GamesPolicy Gradient Methods | CodeCode Available | 0 |
| Variance Reduction in Actor Critic Methods (ACM) | Jul 23, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Shapley Q-value: A Local Reward Approach to Solve Global Reward Games | Jul 11, 2019 | Multi-agent Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Policy Optimization with Stochastic Mirror Descent | Jun 25, 2019 | Continuous ControlPolicy Gradient Methods | —Unverified | 0 |
| Ranking Policy Gradient | Jun 24, 2019 | Policy Gradient MethodsReinforcement Learning | CodeCode Available | 0 |
| Entropic Risk Measure in Policy Search | Jun 21, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies | Jun 19, 2019 | Autonomous DrivingPolicy Gradient Methods | —Unverified | 0 |
| Is the Policy Gradient a Gradient? | Jun 17, 2019 | Open-Ended Question AnsweringPolicy Gradient Methods | —Unverified | 0 |
| A Hybrid Approach Between Adversarial Generative Networks and Actor-Critic Policy Gradient for Low Rate High-Resolution Image Compression | Jun 11, 2019 | DecoderImage Compression | —Unverified | 0 |
| Global Optimality Guarantees For Policy Gradient Methods | Jun 5, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Neural Replicator Dynamics | Jun 1, 2019 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies | May 31, 2019 | DiversityPolicy Gradient Methods | —Unverified | 0 |
| Policy Search by Target Distribution Learning for Continuous Control | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Trajectory-Based Off-Policy Deep Reinforcement Learning | May 14, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Learning Novel Policies For Tasks | May 13, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Object Exchangeability in Reinforcement Learning: Extended Abstract | May 7, 2019 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Neural Logic Reinforcement Learning | Apr 24, 2019 | Deep Reinforcement LearningInductive logic programming | CodeCode Available | 0 |