| Variance Reduction for Policy-Gradient Methods via Empirical Variance Minimization | Jun 14, 2022 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 | 0 |
| Variance Reduction for Policy Gradient with Action-Dependent Factorized Baselines | Mar 20, 2018 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 | 0 |
| Variance Reduction for Reinforcement Learning in Input-Driven Environments | Jul 6, 2018 | Meta-LearningMuJoCo | —Unverified | 0 | 0 |
| Variance Reduction in Actor Critic Methods (ACM) | Jul 23, 2019 | Policy Gradient Methods | —Unverified | 0 | 0 |
| When Do Off-Policy and On-Policy Policy Gradient Methods Align? | Feb 19, 2024 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Diversity-Inducing Policy Gradient: Using Maximum Mean Discrepancy to Find a Set of Diverse Policies | May 31, 2019 | DiversityPolicy Gradient Methods | —Unverified | 0 | 0 |
| Zeroth-Order Supervised Policy Improvement | Jun 11, 2020 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition | Dec 29, 2020 | Action RecognitionPolicy Gradient Methods | —Unverified | 0 | 0 |
| Accelerated Reinforcement Learning | Oct 23, 2017 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| Accelerating Policy Gradient by Estimating Value Function from Prior Computation in Deep Reinforcement Learning | Feb 2, 2023 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 | 0 |
| Action-dependent Control Variates for Policy Optimization via Stein Identity | Jan 1, 2018 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| Actor-Critic Policy Optimization in a Large-Scale Imperfect-Information Game | Sep 29, 2021 | counterfactualDeep Reinforcement Learning | —Unverified | 0 | 0 |
| Actor-Critic Reinforcement Learning with Phased Actor | Apr 18, 2024 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| AdaFrame: Adaptive Frame Selection for Fast Video Recognition | Nov 29, 2018 | Policy Gradient MethodsVideo Recognition | —Unverified | 0 | 0 |
| Confidence-Controlled Exploration: Efficient Sparse-Reward Policy Learning for Robot Navigation | Jun 9, 2023 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| Adaptive Batch Size for Safe Policy Gradients | Dec 1, 2017 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 | 0 |
| Momentum-Based Policy Gradient with Second-Order Information | May 17, 2022 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Adaptive Policy Learning to Additional Tasks | May 24, 2023 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Adaptive Step-Size for Policy Gradient Methods | Dec 1, 2013 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 | 0 |
| Ad Headline Generation using Self-Critical Masked Language Model | Jun 1, 2021 | Headline GenerationLanguage Modeling | —Unverified | 0 | 0 |
| Adversarial Policy Gradient for Alternating Markov Games | Jan 1, 2018 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 | 0 |
| A Hybrid Approach Between Adversarial Generative Networks and Actor-Critic Policy Gradient for Low Rate High-Resolution Image Compression | Jun 11, 2019 | DecoderImage Compression | —Unverified | 0 | 0 |
| A K-fold Method for Baseline Estimation in Policy Gradient Algorithms | Jan 3, 2017 | MuJoCoPolicy Gradient Methods | —Unverified | 0 | 0 |
| A Large Deviations Perspective on Policy Gradient Algorithms | Nov 13, 2023 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| All-Action Policy Gradient Methods: A Numerical Integration Approach | Oct 21, 2019 | Allcontinuous-control | —Unverified | 0 | 0 |