| Learning Decentralized Partially Observable Mean Field Control for Artificial Collective Behavior | Jul 12, 2023 | Policy Gradient MethodsReinforcement Learning (RL) | —Unverified | 0 |
| Provably Convergent Policy Optimization via Metric-aware Trust Region Methods | Jun 25, 2023 | continuous-controlContinuous Control | —Unverified | 0 |
| Correcting discount-factor mismatch in on-policy policy gradient methods | Jun 23, 2023 | OpenAI GymPolicy Gradient Methods | —Unverified | 0 |
| Neural Inventory Control in Networks via Hindsight Differentiable Policy Optimization | Jun 20, 2023 | Deep Reinforcement LearningManagement | CodeCode Available | 1 |
| Acceleration in Policy Optimization | Jun 18, 2023 | Meta-LearningPolicy Gradient Methods | —Unverified | 0 |
| Deep Policy Gradient Methods in Commodity Markets | Jun 14, 2023 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Stepsize Learning for Policy Gradient Methods in Contextual Markov Decision Processes | Jun 13, 2023 | Meta Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 |
| Confidence-Controlled Exploration: Efficient Sparse-Reward Policy Learning for Robot Navigation | Jun 9, 2023 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 |
| Efficient Diffusion Policies for Offline Reinforcement Learning | May 31, 2023 | D4RLOffline RL | CodeCode Available | 1 |
| Solving Robust MDPs through No-Regret Dynamics | May 30, 2023 | NavigatePolicy Gradient Methods | —Unverified | 0 |