| Simple random search provides a competitive approach to reinforcement learning | Mar 19, 2018 | Computational Efficiencycontinuous-control | CodeCode Available | 1 |
| DeepMind Control Suite | Jan 2, 2018 | continuous-controlContinuous Control | CodeCode Available | 1 |
| Learnings Options End-to-End for Continuous Action Tasks | Nov 30, 2017 | MuJoCo | CodeCode Available | 1 |
| Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximation | Aug 17, 2017 | Atari Gamescontinuous-control | CodeCode Available | 1 |
| DART: Noise Injection for Robust Imitation Learning | Mar 27, 2017 | Imitation LearningMuJoCo | CodeCode Available | 1 |
| Evolution Strategies as a Scalable Alternative to Reinforcement Learning | Mar 10, 2017 | Atari GamesMuJoCo | CodeCode Available | 1 |
| Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback | Jul 17, 2025 | EEGMuJoCo | —Unverified | 0 |
| Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound | Jul 15, 2025 | counterfactualDecision Making | —Unverified | 0 |
| Safe Domain Randomization via Uncertainty-Aware Out-of-Distribution Detection and Policy Adaptation | Jul 8, 2025 | MuJoCoOut-of-Distribution Detection | —Unverified | 0 |
| Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study | Jul 8, 2025 | MuJoCoRecommendation Systems | —Unverified | 0 |
| Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across Domains | Jul 2, 2025 | Atari GamesChatbot | CodeCode Available | 0 |
| rQdia: Regularizing Q-Value Distributions With Image Augmentation | Jun 26, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration | Jun 25, 2025 | Imitation LearningMuJoCo | —Unverified | 0 |
| ADDQ: Adaptive Distributional Double Q-Learning | Jun 24, 2025 | Distributional Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement Learning | Jun 24, 2025 | Meta Reinforcement LearningMuJoCo | CodeCode Available | 0 |
| Hard Contacts with Soft Gradients: Refining Differentiable Simulators for Learning and Control | Jun 17, 2025 | MuJoCo | —Unverified | 0 |
| The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning | Jun 16, 2025 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Wasserstein Barycenter Soft Actor-Critic | Jun 11, 2025 | continuous-controlContinuous Control | —Unverified | 0 |
| Modular Recurrence in Contextual MDPs for Universal Morphology Control | Jun 10, 2025 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| MOBODY: Model Based Off-Dynamics Offline Reinforcement Learning | Jun 10, 2025 | Data Augmentationmodel | CodeCode Available | 0 |
| Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation | Jun 9, 2025 | Decision MakingMuJoCo | —Unverified | 0 |
| LLMs for sensory-motor control: Combining in-context and iterative learning | Jun 5, 2025 | MuJoCo | CodeCode Available | 0 |
| Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning | May 29, 2025 | Deep Reinforcement LearningMuJoCo | —Unverified | 0 |
| Enhanced DACER Algorithm with High Diffusion Efficiency | May 29, 2025 | DenoisingImitation Learning | —Unverified | 0 |
| ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning | May 29, 2025 | DenoisingMuJoCo | —Unverified | 0 |