| Policy-Aware Model Learning for Policy Gradient Methods | Feb 28, 2020 | modelModel-based Reinforcement Learning | CodeCode Available | 0 |
| GACEM: Generalized Autoregressive Cross Entropy Method for Multi-Modal Black Box Constraint Satisfaction | Feb 17, 2020 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| On the Convergence Theory of Debiased Model-Agnostic Meta-Reinforcement Learning | Feb 12, 2020 | Meta-LearningMeta Reinforcement Learning | CodeCode Available | 0 |
| Statistically Efficient Off-Policy Policy Gradients | Feb 10, 2020 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Bayesian Residual Policy Optimization: Scalable Bayesian Reinforcement Learning with Clairvoyant Experts | Feb 7, 2020 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Neural MMO v1.3: A Massively Multiagent Game Environment for Training and Evaluating Neural Networks | Jan 31, 2020 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Deep Reinforcement Learning based Blind mmWave MIMO Beam Alignment | Jan 25, 2020 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| A Nonparametric Off-Policy Policy Gradient | Jan 8, 2020 | Density EstimationPolicy Gradient Methods | CodeCode Available | 0 |
| Entropy Regularization with Discounted Future State Distribution in Policy Gradient Methods | Dec 11, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Fast Efficient Hyperparameter Tuning for Policy Gradient Methods | Dec 1, 2019 | Policy Gradient Methods | CodeCode Available | 0 |
| Optimal Resource Allocation in Wireless Control Systems via Deep Policy Gradient | Oct 25, 2019 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| All-Action Policy Gradient Methods: A Numerical Integration Approach | Oct 21, 2019 | Allcontinuous-control | —Unverified | 0 |
| Policy Optimization for H_2 Linear Control with H_ Robustness Guarantee: Implicit Regularization and Global Convergence | Oct 21, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Linear-Quadratic Mean-Field Reinforcement Learning: Convergence of Policy Gradient Methods | Oct 9, 2019 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| V-MPO: On-Policy Maximum a Posteriori Policy Optimization for Discrete and Continuous Control | Sep 26, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| AUGMENTED POLICY GRADIENT METHODS FOR EFFICIENT REINFORCEMENT LEARNING | Sep 25, 2019 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Guided Adaptive Credit Assignment for Sample Efficient Policy Optimization | Sep 25, 2019 | Instruction FollowingPolicy Gradient Methods | —Unverified | 0 |
| Policy Tree Network | Sep 25, 2019 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning | Sep 18, 2019 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 |
| Sample Efficient Policy Gradient Methods with Recursive Variance Reduction | Sep 18, 2019 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 0 |
| Deep Reinforcement Learning Algorithm for Dynamic Pricing of Express Lanes with Multiple Access Locations | Sep 10, 2019 | Deep Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Transfer Reward Learning for Policy Gradient-Based Text Generation | Sep 9, 2019 | Conditional Text GenerationImage Captioning | —Unverified | 0 |
| Multi Pseudo Q-learning Based Deterministic Policy Gradient for Tracking Control of Autonomous Underwater Vehicles | Sep 7, 2019 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Neural Policy Gradient Methods: Global Optimality and Rates of Convergence | Aug 29, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Trajectory-wise Control Variates for Variance Reduction in Policy Gradient Methods | Aug 8, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Health-Informed Policy Gradients for Multi-Agent Reinforcement Learning | Aug 2, 2019 | Multi-agent Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| On the Theory of Policy Gradient Methods: Optimality, Approximation, and Distribution Shift | Aug 1, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Hindsight Trust Region Policy Optimization | Jul 29, 2019 | Atari GamesPolicy Gradient Methods | CodeCode Available | 0 |
| Variance Reduction in Actor Critic Methods (ACM) | Jul 23, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Shapley Q-value: A Local Reward Approach to Solve Global Reward Games | Jul 11, 2019 | Multi-agent Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Policy Optimization with Stochastic Mirror Descent | Jun 25, 2019 | Continuous ControlPolicy Gradient Methods | —Unverified | 0 |
| Ranking Policy Gradient | Jun 24, 2019 | Policy Gradient MethodsReinforcement Learning | CodeCode Available | 0 |
| Ekar: An Explainable Method for Knowledge Aware Recommendation | Jun 22, 2019 | Knowledge-Aware RecommendationKnowledge Graphs | CodeCode Available | 2 |
| Entropic Risk Measure in Policy Search | Jun 21, 2019 | Policy Gradient Methods | —Unverified | 0 |
| Global Convergence of Policy Gradient Methods to (Almost) Locally Optimal Policies | Jun 19, 2019 | Autonomous DrivingPolicy Gradient Methods | —Unverified | 0 |
| Is the Policy Gradient a Gradient? | Jun 17, 2019 | Open-Ended Question AnsweringPolicy Gradient Methods | —Unverified | 0 |
| A Hybrid Approach Between Adversarial Generative Networks and Actor-Critic Policy Gradient for Low Rate High-Resolution Image Compression | Jun 11, 2019 | DecoderImage Compression | —Unverified | 0 |
| Global Optimality Guarantees For Policy Gradient Methods | Jun 5, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Neural Replicator Dynamics | Jun 1, 2019 | counterfactualDeep Reinforcement Learning | CodeCode Available | 0 |
| Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies | May 31, 2019 | DiversityPolicy Gradient Methods | —Unverified | 0 |
| Policy Search by Target Distribution Learning for Continuous Control | May 27, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| Distributional Policy Optimization: An Alternative Approach for Continuous Control | May 23, 2019 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Trajectory-Based Off-Policy Deep Reinforcement Learning | May 14, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Learning Novel Policies For Tasks | May 13, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |
| Object Exchangeability in Reinforcement Learning: Extended Abstract | May 7, 2019 | Deep Reinforcement LearningObject | —Unverified | 0 |
| Neural Logic Reinforcement Learning | Apr 24, 2019 | Deep Reinforcement LearningInductive logic programming | CodeCode Available | 0 |
| Similarities between policy gradient methods (PGM) in Reinforcement learning (RL) and supervised learning (SL) | Apr 12, 2019 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Only Relevant Information Matters: Filtering Out Noisy Samples to Boost RL | Apr 8, 2019 | continuous-controlContinuous Control | —Unverified | 0 |
| StartNet: Online Detection of Action Start in Untrimmed Videos | Mar 23, 2019 | Action ClassificationPolicy Gradient Methods | —Unverified | 0 |
| Evaluating Rewards for Question Generation Models | Feb 28, 2019 | Machine TranslationPolicy Gradient Methods | CodeCode Available | 0 |