| Reinforcement Learning for Causal Discovery without Acyclicity Constraints | Aug 24, 2024 | Causal DiscoveryEfficient Exploration | —Unverified | 0 | 0 |
| All-Action Policy Gradient Methods: A Numerical Integration Approach | Oct 21, 2019 | Allcontinuous-control | —Unverified | 0 | 0 |
| AdaFrame: Adaptive Frame Selection for Fast Video Recognition | Nov 29, 2018 | Policy Gradient MethodsVideo Recognition | —Unverified | 0 | 0 |
| Accelerating Policy Gradient by Estimating Value Function from Prior Computation in Deep Reinforcement Learning | Feb 2, 2023 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 | 0 |
| 2D or not 2D? Adaptive 3D Convolution Selection for Efficient Video Recognition | Dec 29, 2020 | Action RecognitionPolicy Gradient Methods | —Unverified | 0 | 0 |
| Efficient Baseline-free Sampling in Parameter Exploring Policy Gradients: Super Symmetric PGPE | Dec 13, 2013 | Policy Gradient Methods | —Unverified | 0 | 0 |
| A unified view of entropy-regularized Markov decision processes | May 22, 2017 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| AUGMENTED POLICY GRADIENT METHODS FOR EFFICIENT REINFORCEMENT LEARNING | Sep 25, 2019 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| A Large Deviations Perspective on Policy Gradient Algorithms | Nov 13, 2023 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |
| Diverse Exploration via Conjugate Policies for Policy Gradient Methods | Feb 10, 2019 | Policy Gradient Methods | —Unverified | 0 | 0 |