| Merging Decision Transformers: Weight Averaging for Forming Multi-Task Policies | Mar 14, 2023 | Decision MakingMuJoCo | CodeCode Available | 0 |
| Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning | Oct 15, 2024 | D4RLModel-based Reinforcement Learning | CodeCode Available | 0 |
| Variance Penalized On-Policy and Off-Policy Actor-Critic | Feb 3, 2021 | MuJoCo | CodeCode Available | 0 |
| Learning Goal Embeddings via Self-Play for Hierarchical Reinforcement Learning | Nov 22, 2018 | Hierarchical Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| RIZE: Regularized Imitation Learning via Distributional Reinforcement Learning | Feb 27, 2025 | Distributional Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Structured Control Nets for Deep Reinforcement Learning | Feb 22, 2018 | Decision MakingDeep Reinforcement Learning | CodeCode Available | 0 |
| Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain Randomization | Jul 29, 2022 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation | Mar 27, 2025 | MuJoCoSMAC | CodeCode Available | 0 |
| Mildly Constrained Evaluation Policy for Offline Reinforcement Learning | Jun 6, 2023 | D4RLMuJoCo | CodeCode Available | 0 |
| SUPERVISED POLICY UPDATE | May 1, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning Calibratable Policies using Programmatic Style-Consistency | Oct 2, 2019 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| Language as an Abstraction for Hierarchical Deep Reinforcement Learning | Jun 18, 2019 | Deep Reinforcement LearningInstruction Following | CodeCode Available | 0 |
| Supervised Policy Update for Deep Reinforcement Learning | May 29, 2018 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Controlled Diversity with Preference : Towards Learning a Diverse Set of Desired Skills | Mar 7, 2023 | DiversityMuJoCo | CodeCode Available | 0 |
| Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning? | May 20, 2024 | Atari GamesMamba | CodeCode Available | 0 |
| Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUp | Nov 30, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Imitation Learning from Purified Demonstrations | Oct 11, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 |
| Imitation Learning from Observations under Transition Model Disparity | Apr 25, 2022 | Imitation Learningmodel | CodeCode Available | 0 |
| Robust Deep Reinforcement Learning with Adaptive Adversarial Perturbations in Action Space | May 20, 2024 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| MuJoCo: A physics engine for model-based control | Oct 7, 2012 | modelMuJoCo | CodeCode Available | 0 |
| Weak Human Preference Supervision For Deep Reinforcement Learning | Jul 25, 2020 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Human-guided Robot Behavior Learning: A GAN-assisted Preference-based Reinforcement Learning Approach | Oct 15, 2020 | Generative Adversarial NetworkMuJoCo | CodeCode Available | 0 |
| Hierarchical Reinforcement Learning with Advantage-Based Auxiliary Rewards | Oct 10, 2019 | Hierarchical Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy Optimization | Aug 13, 2023 | LEMMAMuJoCo | CodeCode Available | 0 |
| Adaptive Exploration for Data-Efficient General Value Function Evaluations | May 13, 2024 | MuJoCo | CodeCode Available | 0 |