| Entropy annealing for policy mirror descent in continuous time and space | May 30, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Mollification Effects of Policy Gradient Methods | May 28, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Linear Function Approximation as a Computationally Efficient Method to Solve Classical Reinforcement Learning Challenges | May 27, 2024 | AcrobotPolicy Gradient Methods | —Unverified | 0 |
| Matrix Low-Rank Approximation For Policy Gradient Methods | May 27, 2024 | Matrix CompletionPolicy Gradient Methods | CodeCode Available | 0 |
| Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence | May 23, 2024 | Distributional Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Almost sure convergence rates of stochastic gradient methods under gradient domination | May 22, 2024 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| An Initial Introduction to Cooperative Multi-Agent Reinforcement Learning | May 10, 2024 | MisconceptionsMulti-agent Reinforcement Learning | —Unverified | 0 |
| Federated Reinforcement Learning with Constraint Heterogeneity | May 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline | May 4, 2024 | Computational EfficiencyMuJoCo | —Unverified | 0 |
| Information-Theoretic Opacity-Enforcement in Markov Decision Processes | Apr 30, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching | Apr 27, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Actor-Critic Reinforcement Learning with Phased Actor | Apr 18, 2024 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Intervention-Assisted Policy Gradient Methods for Online Stochastic Queuing Network Optimization: Technical Report | Apr 5, 2024 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Elementary Analysis of Policy Gradient Methods | Apr 4, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Self-Improvement for Neural Combinatorial Optimization: Sample without Replacement, but Improvement | Mar 22, 2024 | Combinatorial OptimizationImitation Learning | CodeCode Available | 1 |
| ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy | Mar 21, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles | Mar 18, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries | Mar 15, 2024 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis | Mar 13, 2024 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| Provable Policy Gradient Methods for Average-Reward Markov Potential Games | Mar 9, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process | Mar 7, 2024 | Drug DesignPolicy Gradient Methods | —Unverified | 0 |
| Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control | Mar 7, 2024 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Towards Provable Log Density Policy Gradient | Mar 3, 2024 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Reusing Historical Trajectories in Natural Policy Gradient via Importance Sampling: Convergence and Convergence Rate | Mar 1, 2024 | Policy Gradient Methods | —Unverified | 0 |
| When Do Off-Policy and On-Policy Policy Gradient Methods Align? | Feb 19, 2024 | Policy Gradient Methods | —Unverified | 0 |