| Improving Reward-Conditioned Policies for Multi-Armed Bandits using Normalized Weight Functions | Jun 16, 2024 | Multi-Armed BanditsPolicy Gradient Methods | —Unverified | 0 |
| Current applications and potential future directions of reinforcement learning-based Digital Twins in agriculture | Jun 13, 2024 | Decision MakingManagement | —Unverified | 0 |
| Optimal Rates of Convergence for Entropy Regularization in Discounted Markov Decision Processes | Jun 6, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Entropy annealing for policy mirror descent in continuous time and space | May 30, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Mollification Effects of Policy Gradient Methods | May 28, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Matrix Low-Rank Approximation For Policy Gradient Methods | May 27, 2024 | Matrix CompletionPolicy Gradient Methods | CodeCode Available | 0 |
| Linear Function Approximation as a Computationally Efficient Method to Solve Classical Reinforcement Learning Challenges | May 27, 2024 | AcrobotPolicy Gradient Methods | —Unverified | 0 |
| Policy Gradient Methods for Risk-Sensitive Distributional Reinforcement Learning with Provable Convergence | May 23, 2024 | Distributional Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Almost sure convergence rates of stochastic gradient methods under gradient domination | May 22, 2024 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| An Initial Introduction to Cooperative Multi-Agent Reinforcement Learning | May 10, 2024 | MisconceptionsMulti-agent Reinforcement Learning | —Unverified | 0 |
| Federated Reinforcement Learning with Constraint Heterogeneity | May 6, 2024 | Language ModelingLanguage Modelling | —Unverified | 0 |
| Off-OAB: Off-Policy Policy Gradient Method with Optimal Action-Dependent Baseline | May 4, 2024 | Computational EfficiencyMuJoCo | —Unverified | 0 |
| Information-Theoretic Opacity-Enforcement in Markov Decision Processes | Apr 30, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Control randomisation approach for policy gradient and application to reinforcement learning in optimal switching | Apr 27, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Actor-Critic Reinforcement Learning with Phased Actor | Apr 18, 2024 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Intervention-Assisted Policy Gradient Methods for Online Stochastic Queuing Network Optimization: Technical Report | Apr 5, 2024 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Elementary Analysis of Policy Gradient Methods | Apr 4, 2024 | Policy Gradient Methods | —Unverified | 0 |
| ReAct Meets ActRe: When Language Agents Enjoy Training Data Autonomy | Mar 21, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Towards Global Optimality for Practical Average Reward Reinforcement Learning without Mixing Time Oracles | Mar 18, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Global Convergence Guarantees for Federated Policy Gradient Methods with Adversaries | Mar 15, 2024 | Decision MakingPolicy Gradient Methods | —Unverified | 0 |
| Towards Efficient Risk-Sensitive Policy Gradient: An Iteration Complexity Analysis | Mar 13, 2024 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| Provable Policy Gradient Methods for Average-Reward Markov Potential Games | Mar 9, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Fill-and-Spill: Deep Reinforcement Learning Policy Gradient Methods for Reservoir Operation Decision and Control | Mar 7, 2024 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Stabilizing Policy Gradients for Stochastic Differential Equations via Consistency with Perturbation Process | Mar 7, 2024 | Drug DesignPolicy Gradient Methods | —Unverified | 0 |
| Towards Provable Log Density Policy Gradient | Mar 3, 2024 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |