| Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement Learning | Jul 17, 2024 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| A Game-Theoretic Approach to Multi-Agent Trust Region Optimization | Jun 12, 2021 | Atari GamesMuJoCo | CodeCode Available | 1 |
| DeepMind Control Suite | Jan 2, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Cross-Modal Domain Adaptation for Reinforcement Learning | Jan 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 |
| Generalizable Episodic Memory for Deep Reinforcement Learning | Mar 11, 2021 | Atari Gamescontinuous-control | CodeCode Available | 1 |
| How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy Optimization | Apr 29, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Converting Biomechanical Models from OpenSim to MuJoCo | Jun 17, 2020 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Improving Sample Efficiency in Model-Free Reinforcement Learning from Images | Oct 2, 2019 | Image ReconstructionMuJoCo | CodeCode Available | 1 |
| Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximations | Oct 14, 2024 | Dimensionality ReductionMuJoCo | CodeCode Available | 1 |
| Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement Learning | Dec 1, 2021 | Domain AdaptationMuJoCo | CodeCode Available | 1 |
| FACMAC: Factored Multi-Agent Centralised Policy Gradients | Mar 14, 2020 | MuJoCoMulti-agent Reinforcement Learning | CodeCode Available | 1 |
| Learning Invariant Representations for Reinforcement Learning without Reconstruction | Jun 18, 2020 | Causal InferenceMuJoCo | CodeCode Available | 1 |
| Learnings Options End-to-End for Continuous Action Tasks | Nov 30, 2017 | MuJoCo | CodeCode Available | 1 |
| MetaCURE: Meta Reinforcement Learning with Empowerment-Driven Exploration | Jun 15, 2020 | Efficient ExplorationMeta Reinforcement Learning | CodeCode Available | 1 |
| Lipschitz-constrained Unsupervised Skill Discovery | Feb 2, 2022 | DiversityMuJoCo | CodeCode Available | 1 |
| ARLO: A Framework for Automated Reinforcement Learning | May 20, 2022 | feature selectionMuJoCo | CodeCode Available | 1 |
| Maximum Entropy Reinforcement Learning via Energy-Based Normalizing Flow | May 22, 2024 | IngenuityMuJoCo | CodeCode Available | 1 |
| Max-Min Off-Policy Actor-Critic Method Focusing on Worst-Case Robustness to Model Misspecification | Nov 7, 2022 | MuJoCo | CodeCode Available | 1 |
| A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor Representation | Jun 12, 2021 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 1 |
| Conservative Offline Distributional Reinforcement Learning | Jul 12, 2021 | D4RLDistributional Reinforcement Learning | CodeCode Available | 1 |
| Monte Carlo Tree Search based Variable Selection for High Dimensional Bayesian Optimization | Oct 4, 2022 | Bayesian OptimizationMuJoCo | CodeCode Available | 1 |
| Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation Errors | Jan 9, 2020 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Multi-Agent Trust Region Learning | Jan 1, 2021 | Atari GamesMuJoCo | CodeCode Available | 1 |
| Natural Actor-Critic for Robust Reinforcement Learning with Function Approximation | Jul 17, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 1 |
| Conditioning Sparse Variational Gaussian Processes for Online Decision-making | Oct 28, 2021 | Active LearningDecision Making | CodeCode Available | 1 |