| Learning non-Markovian Decision-Making from State-only Sequences | Jun 27, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 | 5 |
| Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy | Jul 25, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Context-Based Soft Actor Critic for Environments with Non-stationary Dynamics | May 7, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| TEAC: Intergrating Trust Region and Max Entropy Actor Critic for Continuous Control | Jan 1, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Imitating from auxiliary imperfect demonstrations via Adversarial Density Weighted Regression | May 28, 2024 | Imitation LearningMuJoCo | CodeCode Available | 0 | 5 |
| Learning Calibratable Policies using Programmatic Style-Consistency | Oct 2, 2019 | Imitation LearningMuJoCo | CodeCode Available | 0 | 5 |
| Constrained Intrinsic Motivation for Reinforcement Learning | Jul 12, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 0 | 5 |
| Asynchronous Episodic Deep Deterministic Policy Gradient: Towards Continuous Control in Computationally Complex Environments | Mar 3, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent Cooperation | Mar 27, 2025 | MuJoCoSMAC | CodeCode Available | 0 | 5 |
| Language as an Abstraction for Hierarchical Deep Reinforcement Learning | Jun 18, 2019 | Deep Reinforcement LearningInstruction Following | CodeCode Available | 0 | 5 |