| Markov flow policy -- deep MC | May 1, 2024 | MuJoCo | —Unverified | 0 |
| No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPO | May 1, 2024 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 1 |
| UCB-driven Utility Function Search for Multi-objective Reinforcement Learning | May 1, 2024 | Decision MakingMuJoCo | CodeCode Available | 1 |
| Closed Loop Interactive Embodied Reasoning for Robot Manipulation | Apr 23, 2024 | MuJoCoRobot Manipulation | —Unverified | 0 |
| Asynchronous Federated Reinforcement Learning with Policy Gradient Updates: Algorithm Design and Convergence Analysis | Apr 9, 2024 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real Transfer | Apr 8, 2024 | MuJoCoPhysical Simulations | CodeCode Available | 5 |
| DIDA: Denoised Imitation Learning based on Domain Adaptation | Apr 4, 2024 | Domain AdaptationImitation Learning | —Unverified | 0 |
| Active Learning of Dynamics Using Prior Domain Knowledge in the Sampling Process | Mar 25, 2024 | Active LearningMuJoCo | —Unverified | 0 |
| Robust Model Based Reinforcement Learning Using L_1 Adaptive Control | Mar 21, 2024 | Model-based Reinforcement LearningMuJoCo | —Unverified | 0 |
| A Simple Mixture Policy Parameterization for Improving Sample Efficiency of CVaR Optimization | Mar 17, 2024 | MuJoCo | —Unverified | 0 |