| A novel DDPG method with prioritized experience replay | Oct 1, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning | Jun 10, 2025 | Data Augmentationmodel | CodeCode Available | 0 | 5 |
| Out-of-Dynamics Imitation Learning from Multimodal Demonstrations | Nov 13, 2022 | Imitation LearningMuJoCo | CodeCode Available | 0 | 5 |
| Pontryagin Optimal Control via Neural Networks | Dec 30, 2022 | Model-based Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Calibrated Model-Based Deep Reinforcement Learning | Jun 19, 2019 | Deep Reinforcement Learningmodel | CodeCode Available | 0 | 5 |
| On Rollouts in Model-Based Reinforcement Learning | Jan 28, 2025 | modelModel-based Reinforcement Learning | CodeCode Available | 0 | 5 |
| Off-Policy Average Reward Actor-Critic with Deterministic Policy Search | May 20, 2023 | MuJoCo | CodeCode Available | 0 | 5 |
| An Invariant Information Geometric Method for High-Dimensional Online Optimization | Jan 3, 2024 | Bayesian OptimizationMuJoCo | CodeCode Available | 0 | 5 |
| On Learning Intrinsic Rewards for Policy Gradient Methods | Apr 17, 2018 | Atari GamesDecision Making | CodeCode Available | 0 | 5 |
| On the Design of Safe Continual RL Methods for Control of Nonlinear Systems | Feb 21, 2025 | Continual LearningMuJoCo | CodeCode Available | 0 | 5 |
| Bootstrapping the Expressivity with Model-based Planning | Sep 25, 2019 | modelMuJoCo | CodeCode Available | 0 | 5 |
| A dynamical clipping approach with task feedback for Proximal Policy Optimization | Dec 12, 2023 | Language ModellingLarge Language Model | CodeCode Available | 0 | 5 |
| On the Expressivity of Neural Networks for Deep Reinforcement Learning | Oct 14, 2019 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Offline Reinforcement Learning via Inverse Optimization | Feb 27, 2025 | Model Predictive ControlMuJoCo | CodeCode Available | 0 | 5 |
| On the Effect of Pre-training for Transformer in Different Modality on Offline Reinforcement Learning | Nov 17, 2022 | MuJoCo | CodeCode Available | 0 | 5 |
| Adaptive Exploration for Data-Efficient General Value Function Evaluations | May 13, 2024 | MuJoCo | CodeCode Available | 0 | 5 |
| Novel Policy Seeking with Constrained Optimization | May 21, 2020 | DiversityMuJoCo | CodeCode Available | 0 | 5 |
| BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel Optimization | Aug 1, 2023 | Bilevel OptimizationDiversity | CodeCode Available | 0 | 5 |
| An Empirical Study of Deep Reinforcement Learning in Continuing Tasks | Jan 12, 2025 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-Tuning | Aug 8, 2017 | Deep Reinforcement Learningmodel | CodeCode Available | 0 | 5 |
| No Need for Interactions: Robust Model-Based Imitation Learning using Neural ODE | Apr 3, 2021 | Imitation LearningMuJoCo | CodeCode Available | 0 | 5 |
| Beyond Worst-case Attacks: Robust RL with Adaptive Defense via Non-dominated Policies | Feb 20, 2024 | Adversarial AttackMuJoCo | CodeCode Available | 0 | 5 |
| DNS: Determinantal Point Process Based Neural Network Sampler for Ensemble Reinforcement Learning | Jan 31, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| dm_control: Software and Tasks for Continuous Control | Jun 22, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| MuJoCo: A physics engine for model-based control | Oct 7, 2012 | modelMuJoCo | CodeCode Available | 0 | 5 |
| NerveNet: Learning Structured Policy with Graph Neural Networks | Jan 1, 2018 | Benchmarkingcontinuous-control | CodeCode Available | 0 | 5 |
| On the Perturbed States for Transformed Input-robust Reinforcement Learning | Jul 31, 2024 | DenoisingMuJoCo | CodeCode Available | 0 | 5 |
| Merging Decision Transformers: Weight Averaging for Forming Multi-Task Policies | Mar 14, 2023 | Decision MakingMuJoCo | CodeCode Available | 0 | 5 |
| MDP Playground: An Analysis and Debug Testbed for Reinforcement Learning | Sep 17, 2019 | MuJoCoOpenAI Gym | CodeCode Available | 0 | 5 |
| Mildly Constrained Evaluation Policy for Offline Reinforcement Learning | Jun 6, 2023 | D4RLMuJoCo | CodeCode Available | 0 | 5 |
| Lyapunov-based Safe Policy Optimization for Continuous Control | Jan 28, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Bayesian Policy Gradients via Alpha Divergence Dropout Inference | Dec 6, 2017 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| LLMs for sensory-motor control: Combining in-context and iterative learning | Jun 5, 2025 | MuJoCo | CodeCode Available | 0 | 5 |
| Live in the Moment: Learning Dynamics Model Adapted to Evolving Policy | Jul 25, 2022 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Online Reinforcement Learning in Non-Stationary Context-Driven Environments | Feb 4, 2023 | MuJoCoreinforcement-learning | CodeCode Available | 0 | 5 |
| Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement Learning | Oct 15, 2024 | D4RLModel-based Reinforcement Learning | CodeCode Available | 0 | 5 |
| Directly Forecasting Belief for Reinforcement Learning with Delays | May 1, 2025 | D4RLMuJoCo | CodeCode Available | 0 | 5 |
| Fuzzy Tiling Activations: A Simple Approach to Learning Sparse Representations Online | Nov 19, 2019 | Continual Learningcontinuous-control | CodeCode Available | 0 | 5 |
| Leveraging exploration in off-policy algorithms via normalizing flows | May 16, 2019 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Balancing Value Underestimation and Overestimation with Realistic Actor-Critic | Oct 19, 2021 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios | Oct 12, 2023 | Board GamesDecision Making | CodeCode Available | 0 | 5 |
| Locally Persistent Exploration in Continuous Control Tasks with Sparse Rewards | Dec 26, 2020 | continuous-controlContinuous Control | CodeCode Available | 0 | 5 |
| Learning Powerful Policies by Using Consistent Dynamics Model | Jun 11, 2019 | Atari Gamesmodel | CodeCode Available | 0 | 5 |
| Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning | Jun 24, 2025 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Decision Transformer under Random Frame Dropping | Mar 3, 2023 | Deep Reinforcement LearningMuJoCo | CodeCode Available | 0 | 5 |
| Debiased Offline Representation Learning for Fast Online Adaptation in Non-stationary Dynamics | Feb 17, 2024 | MuJoCoRepresentation Learning | CodeCode Available | 0 | 5 |
| BAIL: Best-Action Imitation Learning for Batch Deep Reinforcement Learning | Oct 27, 2019 | Deep Reinforcement LearningImitation Learning | CodeCode Available | 0 | 5 |
| Learning non-Markovian Decision-Making from State-only Sequences | Jun 27, 2023 | Decision MakingImitation Learning | CodeCode Available | 0 | 5 |
| Learning to Play Cup-and-Ball with Noisy Camera Observations | Jul 19, 2020 | MuJoCo | CodeCode Available | 0 | 5 |
| A Generalized Training Approach for Multiagent Learning | Sep 27, 2019 | MuJoCo | CodeCode Available | 0 | 5 |