| Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate | Mar 1, 2024 | Policy Gradient Methods | —Unverified | 0 |
| When Do Off-Policy and On-Policy Policy Gradient Methods Align? | Feb 19, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Identifying Policy Gradient Subspaces | Jan 12, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Global Convergence of Natural Policy Gradient with Hessian-aided Momentum Variance Reduction | Jan 2, 2024 | MuJoCoPolicy Gradient Methods | —Unverified | 0 |
| Training Diffusion Models Towards Diverse Image Generation with Reinforcement Learning | Jan 1, 2024 | Decision MakingDiversity | —Unverified | 0 |
| Optimistic Policy Gradient in Multi-Player Markov Games with a Single Controller: Convergence Beyond the Minty Property | Dec 19, 2023 | Policy Gradient Methods | —Unverified | 0 |
| Privacy Preserving Multi-Agent Reinforcement Learning in Supply Chains | Dec 9, 2023 | Multi-agent Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| RL Dreams: Policy Gradient Optimization for Score Distillation based 3D Generation | Dec 8, 2023 | 3D GenerationDenoising | —Unverified | 0 |
| Score-Aware Policy-Gradient Methods and Performance Guarantees using Local Lyapunov Conditions: Applications to Product-Form Stochastic Networks and Queueing Systems | Dec 5, 2023 | FormModel-based Reinforcement Learning | —Unverified | 0 |
| Predictable Reinforcement Learning Dynamics through Entropy Rate Minimization | Nov 30, 2023 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 0 |
| A Large Deviations Perspective on Policy Gradient Algorithms | Nov 13, 2023 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Clipped-Objective Policy Gradients for Pessimistic Policy Optimization | Nov 10, 2023 | Deep Reinforcement LearningMulti-Task Learning | CodeCode Available | 0 |
| On the Second-Order Convergence of Biased Policy Gradient Algorithms | Nov 5, 2023 | Policy Gradient Methods | —Unverified | 0 |
| Riemannian stochastic optimization methods avoid strict saddle points | Nov 4, 2023 | Dictionary LearningPolicy Gradient Methods | —Unverified | 0 |
| Federated Natural Policy Gradient and Actor Critic Methods for Multi-task Reinforcement Learning | Nov 1, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Optimization Landscape of Policy Gradient Methods for Discrete-time Static Output Feedback | Oct 29, 2023 | Policy Gradient Methods | —Unverified | 0 |
| Accelerated Policy Gradient: On the Convergence Rates of the Nesterov Momentum for Reinforcement Learning | Oct 18, 2023 | Policy Gradient Methodsreinforcement-learning | CodeCode Available | 0 |
| f-Policy Gradients: A General Framework for Goal Conditioned RL using f-Divergences | Oct 10, 2023 | Efficient ExplorationPolicy Gradient Methods | —Unverified | 0 |
| Optimizing Solution-Samplers for Combinatorial Problems: The Landscape of Policy-Gradient Methods | Oct 8, 2023 | Policy Gradient MethodsTraveling Salesman Problem | —Unverified | 0 |
| Global Convergence of Policy Gradient Methods in Reinforcement Learning, Games and Control | Oct 8, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Beyond Stationarity: Convergence Analysis of Stochastic Softmax Policy Gradient Methods | Oct 4, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Sample Complexity of Neural Policy Mirror Descent for Policy Optimization on Low-Dimensional Manifolds | Sep 25, 2023 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| Oracle Complexity Reduction for Model-free LQR: A Stochastic Variance-Reduced Policy Gradient Approach | Sep 19, 2023 | Policy Gradient Methods | CodeCode Available | 0 |
| Learning Zero-Sum Linear Quadratic Games with Improved Sample Complexity and Last-Iterate Convergence | Sep 8, 2023 | Multi-agent Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Commodities Trading through Deep Policy Gradient Methods | Aug 10, 2023 | Algorithmic TradingDeep Reinforcement Learning | —Unverified | 0 |
| Hindsight-DICE: Stable Credit Assignment for Deep Reinforcement Learning | Jul 21, 2023 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Enabling Efficient, Reliable Real-World Reinforcement Learning with Approximate Physics-Based Models | Jul 16, 2023 | Policy Gradient Methods | CodeCode Available | 0 |
| Learning Decentralized Partially Observable Mean Field Control for Artificial Collective Behavior | Jul 12, 2023 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| Provably Convergent Policy Optimization via Metric-aware Trust Region Methods | Jun 25, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Correcting discount-factor mismatch in on-policy policy gradient methods | Jun 23, 2023 | OpenAI GymPolicy Gradient Methods | —Unverified | 0 |
| Acceleration in Policy Optimization | Jun 18, 2023 | Meta-LearningPolicy Gradient Methods | —Unverified | 0 |
| Deep Policy Gradient Methods in Commodity Markets | Jun 14, 2023 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Stepsize Learning for Policy Gradient Methods in Contextual Markov Decision Processes | Jun 13, 2023 | Meta Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Confidence-Controlled Exploration: Efficient Sparse-Reward Policy Learning for Robot Navigation | Jun 9, 2023 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Solving Robust MDPs through No-Regret Dynamics | May 30, 2023 | NavigatePolicy Gradient Methods | —Unverified | 0 |
| Adaptive Policy Learning to Additional Tasks | May 24, 2023 | Policy Gradient Methods | —Unverified | 0 |
| Shattering the Agent-Environment Interface for Fine-Tuning Inclusive Language Models | May 19, 2023 | Efficient ExplorationLanguage Modeling | —Unverified | 0 |
| Client Selection for Federated Policy Optimization with Environment Heterogeneity | May 18, 2023 | MuJoCoPolicy Gradient Methods | CodeCode Available | 0 |
| Policy Gradient Methods for Discrete Time Linear Quadratic Regulator With Random Parameters | Mar 29, 2023 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Policy Mirror Descent Inherently Explores Action Space | Mar 8, 2023 | Efficient ExplorationGeneral Reinforcement Learning | —Unverified | 0 |
| Policy gradient learning methods for stochastic control with exit time and applications to share repurchase pricing | Feb 14, 2023 | Policy Gradient Methods | —Unverified | 0 |
| A Policy Gradient Framework for Stochastic Optimal Control Problems with Global Convergence Guarantee | Feb 11, 2023 | Policy Gradient Methods | —Unverified | 0 |
| Distributional constrained reinforcement learning for supply chain optimization | Feb 3, 2023 | Distributional Reinforcement LearningPolicy Gradient Methods | CodeCode Available | 0 |
| Stochastic Policy Gradient Methods: Improved Sample Complexity for Fisher-non-degenerate Policies | Feb 3, 2023 | Policy Gradient Methods | —Unverified | 0 |
| Accelerating Policy Gradient by Estimating Value Function from Prior Computation in Deep Reinforcement Learning | Feb 2, 2023 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Policy Gradient for Rectangular Robust Markov Decision Processes | Jan 31, 2023 | FormPolicy Gradient Methods | —Unverified | 0 |
| SoftTreeMax: Exponential Variance Reduction in Policy Gradient via Tree Search | Jan 30, 2023 | GPUPolicy Gradient Methods | —Unverified | 0 |
| Stochastic Dimension-reduced Second-order Methods for Policy Optimization | Jan 28, 2023 | Policy Gradient MethodsSecond-order methods | —Unverified | 0 |
| On the Global Convergence of Risk-Averse Policy Gradient Methods with Expected Conditional Risk Measures | Jan 26, 2023 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| On the Convergence of Discounted Policy Gradient Methods | Dec 28, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |