| FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control | May 28, 2025 | GPUHumanoid Control | —Unverified | 0 |
| Collision- and Reachability-Aware Multi-Robot Control with Grounded LLM Planners | May 26, 2025 | MuJoCovalid | —Unverified | 0 |
| Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network | May 26, 2025 | Evolutionary AlgorithmsMuJoCo | —Unverified | 0 |
| Reinforcement Learning for Ballbot Navigation in Uneven Terrain | May 23, 2025 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| LLM-Explorer: A Plug-in Reinforcement Learning Policy Exploration Enhancement Driven by Large Language Models | May 21, 2025 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Policy-Driven World Model Adaptation for Robust Offline Model-based Reinforcement Learning | May 19, 2025 | D4RLmodel | —Unverified | 0 |
| Cache-Efficient Posterior Sampling for Reinforcement Learning with LLM-Derived Priors Across Discrete and Continuous Domains | May 12, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Offline Multi-agent Reinforcement Learning via Score Decomposition | May 9, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Model Tensor Planning | May 2, 2025 | modelModel Predictive Control | CodeCode Available | 1 |
| Directly Forecasting Belief for Reinforcement Learning with Delays | May 1, 2025 | D4RLMuJoCo | CodeCode Available | 0 |