| Information-Theoretic Opacity-Enforcement in Markov Decision Processes | Apr 30, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Intervention-Assisted Policy Gradient Methods for Online Stochastic Queuing Network Optimization: Technical Report | Apr 5, 2024 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Is the Policy Gradient a Gradient? | Jun 17, 2019 | Open-Ended Question AnsweringPolicy Gradient Methods | —Unverified | 0 |
| KIPPO: Koopman-Inspired Proximal Policy Optimization | May 20, 2025 | Computational Efficiencycontinuous-control | —Unverified | 0 |
| Landscape of Policy Optimization for Finite Horizon MDPs with General State and Action | Sep 25, 2024 | Policy Gradient Methods | —Unverified | 0 |
| Learning Decentralized Partially Observable Mean Field Control for Artificial Collective Behavior | Jul 12, 2023 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| Learning Dynamics and Generalization in Reinforcement Learning | Jun 5, 2022 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Learning from Algorithm Feedback: One-Shot SAT Solver Guidance with GNNs | May 21, 2025 | Combinatorial OptimizationPolicy Gradient Methods | —Unverified | 0 |
| Learning in complex action spaces without policy gradients | Oct 8, 2024 | Policy Gradient MethodsQ-Learning | —Unverified | 0 |
| Learning Novel Policies For Tasks | May 13, 2019 | Policy Gradient MethodsReinforcement Learning | —Unverified | 0 |