| Periodic Intra-Ensemble Knowledge Distillation for Reinforcement Learning | Feb 1, 2020 | Knowledge DistillationMuJoCo | CodeCode Available | 0 |
| A novel DDPG method with prioritized experience replay | Oct 1, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Client Selection for Federated Policy Optimization with Environment Heterogeneity | May 18, 2023 | MuJoCoPolicy Gradient Methods | CodeCode Available | 0 |
| ADDQ: Adaptive Distributional Double Q-Learning | Jun 24, 2025 | Distributional Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| PMAT: Optimizing Action Generation Order in Multi-Agent Reinforcement Learning | Feb 23, 2025 | Action GenerationDecision Making | CodeCode Available | 0 |
| MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning | Jun 10, 2025 | Data Augmentationmodel | CodeCode Available | 0 |
| Sequence Modeling of Temporal Credit Assignment for Episodic Reinforcement Learning | May 31, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Unifying Variational Inference and PAC-Bayes for Supervised Learning that Scales | Oct 23, 2019 | MuJoCoVariational Inference | CodeCode Available | 0 |
| CGAR: Critic Guided Action Redistribution in Reinforcement Leaning | Jun 23, 2022 | MuJoCoReinforcement Learning (RL) | CodeCode Available | 0 |
| CEM-GD: Cross-Entropy Method with Gradient Descent Planner for Model-Based Reinforcement Learning | Dec 14, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 |