| Application of linear regression method to the deep reinforcement learning in continuous action cases | Mar 19, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Residual Policy Gradient: A Reward View of KL-regularized Objective | Mar 14, 2025 | Imitation LearningMuJoCo | —Unverified | 0 |
| An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable Simulation | Mar 13, 2025 | MuJoCo | CodeCode Available | 1 |
| AVG-DICE: Stationary Distribution Correction by Regression | Mar 3, 2025 | AvgMuJoCo | —Unverified | 0 |
| SrSv: Integrating Sequential Rollouts with Sequential Value Estimation for Multi-agent Reinforcement Learning | Mar 3, 2025 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| IL-SOAR : Imitation Learning with Soft Optimistic Actor cRitic | Feb 27, 2025 | Imitation LearningMuJoCo | —Unverified | 0 |
| Offline Reinforcement Learning via Inverse Optimization | Feb 27, 2025 | Model Predictive ControlMuJoCo | CodeCode Available | 0 |
| RIZE: Regularized Imitation Learning via Distributional Reinforcement Learning | Feb 27, 2025 | Distributional Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Yes, Q-learning Helps Offline In-Context RL | Feb 24, 2025 | In-Context Reinforcement LearningMuJoCo | —Unverified | 0 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 |