| Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation | Jul 17, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning | Jul 4, 2023 | Data AugmentationDiversity | —Unverified | 0 |
| Learning non-Markovian Decision-Making from State-only Sequences | Jun 27, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| CEIL: Generalized Contextual Imitation Learning | Jun 26, 2023 | D4RLImitation Learning | —Unverified | 0 |
| Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy Imitation | Jun 23, 2023 | Few-Shot Image ClassificationFew-Shot Imitation Learning | CodeCode Available | 0 |
| Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback | Jun 20, 2023 | MuJoCoQ-Learning | —Unverified | 0 |
| Surfer: Progressive Reasoning with World Models for Robotic Manipulation | Jun 20, 2023 | Decision MakingMuJoCo | —Unverified | 0 |
| Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication | Jun 20, 2023 | Deep Reinforcement LearningEvolutionary Algorithms | —Unverified | 0 |
| AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents | Jun 19, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Maximum Entropy Heterogeneous-Agent Reinforcement Learning | Jun 19, 2023 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 2 |