| A Study of Policy Gradient on a Class of Exactly Solvable Models | Nov 3, 2020 | Policy Gradient Methods | —Unverified | 0 |
| Assumption Questioning: Latent Copying and Reward Exploitation in Question Generation | Sep 27, 2018 | Inductive BiasMachine Translation | —Unverified | 0 |
| Actor-Critic Reinforcement Learning with Phased Actor | Apr 18, 2024 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Deterministic Policy Gradient Primal-Dual Methods for Continuous-Space Constrained MDPs | Aug 19, 2024 | continuous-controlContinuous Control | —Unverified | 0 |
| Difference Rewards Policy Gradients | Dec 21, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 |
| DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning | Sep 18, 2019 | Deep Reinforcement LearningMotion Planning | —Unverified | 0 |
| A Self-Supervised Reinforcement Learning Approach for Fine-Tuning Large Language Models Using Cross-Attention Signals | Feb 14, 2025 | Policy Gradient Methods | —Unverified | 0 |
| Current applications and potential future directions of reinforcement learning-based Digital Twins in agriculture | Jun 13, 2024 | Decision MakingManagement | —Unverified | 0 |
| Deep Policy Gradient Methods in Commodity Markets | Jun 14, 2023 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Curious Explorer: a provable exploration strategy in Policy Learning | Jun 29, 2021 | Policy Gradient Methods | —Unverified | 0 |