| Entropy Regularized Task Representation Learning for Offline Meta-Reinforcement Learning | Dec 19, 2024 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Hierarchical Subspaces of Policies for Continual Offline Reinforcement Learning | Dec 19, 2024 | Continual LearningMuJoCo | —Unverified | 0 |
| SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks | Dec 17, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Active Reinforcement Learning Strategies for Offline Policy Improvement | Dec 17, 2024 | Active Learningcontinuous-control | —Unverified | 0 |
| RAT: Adversarial Attacks on Deep Reinforcement Agents for Targeted Behaviors | Dec 14, 2024 | Adversarial AttackDeep Reinforcement Learning | —Unverified | 0 |
| Inverse Delayed Reinforcement Learning | Dec 4, 2024 | MuJoCoreinforcement-learning | —Unverified | 0 |
| Hierarchical Prompt Decision Transformer: Improving Few-Shot Policy Generalization with Global and Adaptive Guidance | Dec 1, 2024 | MuJoCo | —Unverified | 0 |
| Fast Convergence of Softmax Policy Mirror Ascent | Nov 18, 2024 | MuJoCo | —Unverified | 0 |
| Multi-Objective Algorithms for Learning Open-Ended Robotic Problems | Nov 11, 2024 | DiversityEvolutionary Algorithms | —Unverified | 0 |
| Imitation from Diverse Behaviors: Wasserstein Quality Diversity Imitation Learning with Single-Step Archive Exploration | Nov 11, 2024 | continuous-controlContinuous Control | —Unverified | 0 |