| Mimicking Better by Matching the Approximate Action Distribution | Jun 16, 2023 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Recurrent Action Transformer with Memory | Jun 15, 2023 | Atari GamesMuJoCo | CodeCode Available | 0 |
| Language to Rewards for Robotic Skill Synthesis | Jun 14, 2023 | In-Context LearningLogical Reasoning | —Unverified | 0 |
| Robust Reinforcement Learning through Efficient Adversarial Herding | Jun 12, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Mildly Constrained Evaluation Policy for Offline Reinforcement Learning | Jun 6, 2023 | D4RLMuJoCo | CodeCode Available | 0 |
| ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive Advantages | Jun 2, 2023 | Bayesian Inferencecontinuous-control | CodeCode Available | 0 |
| MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL | May 31, 2023 | MuJoCoReinforcement Learning (RL) | —Unverified | 0 |
| A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem | May 26, 2023 | MuJoCoMulti-agent Reinforcement Learning | —Unverified | 0 |
| Inverse Reinforcement Learning with the Average Reward Criterion | May 24, 2023 | MuJoCoreinforcement-learning | —Unverified | 0 |
| OER: Offline Experience Replay for Continual Offline Reinforcement Learning | May 23, 2023 | Continual LearningMuJoCo | —Unverified | 0 |