| Variational OOD State Correction for Offline Reinforcement Learning | May 1, 2025 | Decision MakingMuJoCo | —Unverified | 0 |
| Text-to-Decision Agent: Learning Generalist Policies from Natural Language Supervision | Apr 21, 2025 | MuJoCoZero-shot Generalization | —Unverified | 0 |
| Learning Transferable Friction Models and LuGre Identification via Physics Informed Neural Networks | Apr 16, 2025 | Computational EfficiencyFriction | —Unverified | 0 |
| Adapting World Models with Latent-State Dynamics Residuals | Apr 3, 2025 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| Beyond Non-Expert Demonstrations: Outcome-Driven Action Constraint for Offline Reinforcement Learning | Apr 2, 2025 | MuJoCoUncertainty Quantification | —Unverified | 0 |
| Handling Delay in Real-Time Reinforcement Learning | Mar 30, 2025 | MuJoCoreinforcement-learning | CodeCode Available | 0 |
| Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation | Mar 27, 2025 | MuJoCoSMAC | CodeCode Available | 0 |
| Adventurer: Exploration with BiGAN for Deep Reinforcement Learning | Mar 24, 2025 | Atari GamesDeep Reinforcement Learning | —Unverified | 0 |
| CAE: Repurposing the Critic as an Explorer in Deep Reinforcement Learning | Mar 23, 2025 | Deep Reinforcement LearningEfficient Exploration | —Unverified | 0 |
| Likelihood Reward Redistribution | Mar 20, 2025 | MuJoCo | —Unverified | 0 |