SOTAVerified

MuJoCo

Papers

Showing 126150 of 677 papers

TitleStatusHype
Simple random search provides a competitive approach to reinforcement learningCode1
DeepMind Control SuiteCode1
Learnings Options End-to-End for Continuous Action TasksCode1
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximationCode1
DART: Noise Injection for Robust Imitation LearningCode1
Evolution Strategies as a Scalable Alternative to Reinforcement LearningCode1
Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback0
Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound0
Safe Domain Randomization via Uncertainty-Aware Out-of-Distribution Detection and Policy Adaptation0
Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study0
Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across DomainsCode0
rQdia: Regularizing Q-Value Distributions With Image Augmentation0
Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration0
ADDQ: Adaptive Distributional Double Q-LearningCode0
Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement LearningCode0
Hard Contacts with Soft Gradients: Refining Differentiable Simulators for Learning and Control0
The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning0
Wasserstein Barycenter Soft Actor-Critic0
Modular Recurrence in Contextual MDPs for Universal Morphology Control0
MOBODY: Model Based Off-Dynamics Offline Reinforcement LearningCode0
Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation0
LLMs for sensory-motor control: Combining in-context and iterative learningCode0
Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning0
Enhanced DACER Algorithm with High Diffusion Efficiency0
ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning0
Show:102550
← PrevPage 6 of 28Next →

No leaderboard results yet.