| Simple Policy Optimization | Jan 29, 2024 | MuJoCo | CodeCode Available | 2 |
| Episodic Reinforcement Learning with Expanded State-reward Space | Jan 19, 2024 | Autonomous DrivingDeep Reinforcement Learning | —Unverified | 0 |
| AgentMixer: Multi-Agent Correlated Policy Factorization | Jan 16, 2024 | Imitation LearningMuJoCo | —Unverified | 0 |
| Neural Population Learning beyond Symmetric Zero-sum Games | Jan 10, 2024 | MuJoCoTransfer Learning | —Unverified | 0 |
| An Invariant Information Geometric Method for High-Dimensional Online Optimization | Jan 3, 2024 | Bayesian OptimizationMuJoCo | CodeCode Available | 0 |
| Global Convergence of Natural Policy Gradient with Hessian-aided Momentum Variance Reduction | Jan 2, 2024 | MuJoCoPolicy Gradient Methods | —Unverified | 0 |
| Adaptive trajectory-constrained exploration strategy for deep reinforcement learning | Dec 27, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Efficient Reinforcement Learning via Decoupling Exploration and Utilization | Dec 26, 2023 | Autonomous VehiclesMuJoCo | CodeCode Available | 1 |
| XuanCe: A Comprehensive and Unified Deep Reinforcement Learning Library | Dec 25, 2023 | CPUDeep Reinforcement Learning | CodeCode Available | 3 |
| DexDLO: Learning Goal-Conditioned Dexterous Policy for Dynamic Manipulation of Deformable Linear Objects | Dec 23, 2023 | MuJoCoPosition | —Unverified | 0 |