| Health-Informed Policy Gradients for Multi-Agent Reinforcement Learning | Aug 2, 2019 | Multi-agent Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift | Aug 1, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Hindsight Trust Region Policy Optimization | Jul 29, 2019 | Atari GamesPolicy Gradient Methods | CodeCode Available | 0 |
| Variance Reduction in Actor Critic Methods (ACM) | Jul 23, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Shapley Q-value: A Local Reward Approach to Solve Global Reward Games | Jul 11, 2019 | Multi-agent Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Policy Optimization with Stochastic Mirror Descent | Jun 25, 2019 | Continuous ControlPolicy Gradient Methods | —Unverified | 0 |
| Ranking Policy Gradient | Jun 24, 2019 | Policy Gradient MethodsReinforcement Learning | CodeCode Available | 0 |
| Ekar: An Explainable Method for Knowledge Aware Recommendation | Jun 22, 2019 | Knowledge-Aware RecommendationKnowledge Graphs | CodeCode Available | 2 |
| Entropic Risk Measure in Policy Search | Jun 21, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies | Jun 19, 2019 | Autonomous DrivingPolicy Gradient Methods | —Unverified | 0 |
| Is the Policy Gradient a Gradient? | Jun 17, 2019 | Open-Ended Question AnsweringPolicy Gradient Methods | —Unverified | 0 |
| A Hybrid Approach Between Adversarial Generative Networks and Actor-Critic Policy Gradient for Low Rate High-Resolution Image Compression | Jun 11, 2019 | DecoderImage Compression | —Unverified | 0 |
| Global Optimality Guarantees For Policy Gradient Methods | Jun 5, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Neural Replicator Dynamics | Jun 1, 2019 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies | May 31, 2019 | DiversityPolicy Gradient Methods | —Unverified | 0 |
| Policy Search by Target Distribution Learning for Continuous Control | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Distributional Policy Optimization: An Alternative Approach for Continuous Control | May 23, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Trajectory-Based Off-Policy Deep Reinforcement Learning | May 14, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Learning Novel Policies For Tasks | May 13, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Object Exchangeability in Reinforcement Learning: Extended Abstract | May 7, 2019 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Neural Logic Reinforcement Learning | Apr 24, 2019 | Deep Reinforcement LearningInductive logic programming | CodeCode Available | 0 |
| Similarities between policy gradient methods (PGM) in Reinforcement learning (RL) and supervised learning (SL) | Apr 12, 2019 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL | Apr 8, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| StartNet: Online Detection of Action Start in Untrimmed Videos | Mar 23, 2019 | Action ClassificationPolicy Gradient Methods | —Unverified | 0 |
| Evaluating Rewards for Question Generation Models | Feb 28, 2019 | Machine TranslationPolicy Gradient Methods | CodeCode Available | 0 |