| On the Expressivity of Neural Networks for Deep Reinforcement Learning | Oct 14, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Primal Wasserstein Imitation Learning | Jun 8, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Which Experiences Are Influential for Your Agent? Policy Iteration with Turn-over Dropout | Jan 26, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 0 |
| Probabilistic Mixture-of-Experts for Efficient Deep Reinforcement Learning | Apr 19, 2021 | Deep Reinforcement LearningMixture-of-Experts | CodeCode Available | 0 |
| BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization | Aug 1, 2023 | Bilevel OptimizationDiversity | CodeCode Available | 0 |
| Proximal Policy Distillation | Jul 21, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning | Oct 27, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 |
| Back to Basics: Benchmarking Canonical Evolution Strategies for Playing Atari | Feb 24, 2018 | Atari GamesBenchmarking | CodeCode Available | 0 |
| Q-Prop: Sample-Efficient Policy Gradient with An Off-Policy Critic | Nov 7, 2016 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Time Discretization-Invariant Safe Action Repetition for Policy Gradient Methods | Nov 6, 2021 | MuJoCoPolicy Gradient Methods | CodeCode Available | 0 |
| SMOSE: Sparse Mixture of Shallow Experts for Interpretable Reinforcement Learning in Continuous Control Tasks | Dec 17, 2024 | continuous-controlContinuous Control | CodeCode Available | 0 |
| Q-Value Weighted Regression: Reinforcement Learning with Limited Data | Feb 12, 2021 | Atari Gamescontinuous-control | CodeCode Available | 0 |
| Snapshot Reinforcement Learning: Leveraging Prior Trajectories for Efficiency | Mar 1, 2024 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Episodic Curiosity through Reachability | Oct 4, 2018 | MuJoCoReinforcement Learning | CodeCode Available | 0 |
| An Empirical Study of Deep Reinforcement Learning in Continuing Tasks | Jan 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Entropy Regularized Task Representation Learning for Offline Meta-Reinforcement Learning | Dec 19, 2024 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| SOAP-RL: Sequential Option Advantage Propagation for Reinforcement Learning in POMDP Environments | Jul 26, 2024 | MuJoCo | CodeCode Available | 0 |
| Enhancing Online Reinforcement Learning with Meta-Learned Objective from Offline Data | Jan 13, 2025 | Imitation LearningMuJoCo | CodeCode Available | 0 |
| AdaStop: adaptive statistical testing for sound comparisons of Deep RL agents | Jun 19, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Recurrent Action Transformer with Memory | Jun 15, 2023 | Atari GamesMuJoCo | CodeCode Available | 0 |
| A general class of surrogate functions for stable and efficient reinforcement learning | Aug 12, 2021 | MuJoCoPolicy Gradient Methods | CodeCode Available | 0 |
| Regret Minimization Experience Replay in Off-Policy Reinforcement Learning | May 15, 2021 | MuJoCoreinforcement-learning | CodeCode Available | 0 |
| Regularized Anderson Acceleration for Off-Policy Deep Reinforcement Learning | Sep 7, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Adaptive trajectory-constrained exploration strategy for deep reinforcement learning | Dec 27, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| ToriLLE: Learning Environment for Hand-to-Hand Combat | Jul 26, 2018 | BIG-bench Machine LearningMuJoCo | CodeCode Available | 0 |