| Augmented Bayesian Policy Search | Jul 5, 2024 | Bayesian OptimizationLEMMA | —Unverified | 0 | 0 |
| Asynchronous stochastic approximations with asymptotically biased errors and deep multi-agent learning | Feb 22, 2018 | Multi-agent Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 | 0 |
| A K-fold Method for Baseline Estimation in Policy Gradient Algorithms | Jan 3, 2017 | MuJoCoPolicy Gradient Methods | —Unverified | 0 | 0 |
| Difference Rewards Policy Gradients | Dec 21, 2020 | counterfactualMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Asynchronous Multi-Agent Actor-Critic with Macro-Actions | Sep 29, 2021 | Decision MakingPolicy Gradient Methods | —Unverified | 0 | 0 |
| Deterministic Policy Gradient Primal-Dual Methods for Continuous-Space Constrained MDPs | Aug 19, 2024 | continuous-controlContinuous Control | —Unverified | 0 | 0 |
| Asynchronous Actor-Critic for Multi-Agent Reinforcement Learning | Sep 20, 2022 | Decision MakingMulti-agent Reinforcement Learning | —Unverified | 0 | 0 |
| Deep Reinforcement Learning based Blind mmWave MIMO Beam Alignment | Jan 25, 2020 | Deep Reinforcement LearningPolicy Gradient Methods | —Unverified | 0 | 0 |
| A Study of Policy Gradient on a Class of Exactly Solvable Models | Nov 3, 2020 | Policy Gradient Methods | —Unverified | 0 | 0 |
| Improving Sample Efficiency and Multi-Agent Communication in RL-based Train Rescheduling | Apr 28, 2020 | Policy Gradient Methodsreinforcement-learning | —Unverified | 0 | 0 |