SOTAVerified

MuJoCo

Papers

Showing 101150 of 677 papers

TitleStatusHype
FORK: A Forward-Looking Actor For Model-Free Reinforcement LearningCode1
Revisiting Design Choices in Proximal Policy OptimizationCode1
Sample-Efficient Automated Deep Reinforcement LearningCode1
Imitation Learning with Sinkhorn DistancesCode1
Contrastive Variational Reinforcement Learning for Complex ObservationsCode1
Robust Deep Reinforcement Learning through Adversarial LossCode1
Nengo and low-power AI hardware for robust, embedded neuroroboticsCode1
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience ReplayCode1
Fast Adaptation via Policy-Dynamics Value FunctionsCode1
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via MetagradientCode1
Learning Invariant Representations for Reinforcement Learning without ReconstructionCode1
Converting Biomechanical Models from OpenSim to MuJoCoCode1
MetaCURE: Meta Reinforcement Learning with Empowerment-Driven ExplorationCode1
Wasserstein Distance guided Adversarial Imitation Learning with Reward Shape ExplorationCode1
Delay-Aware Model-Based Reinforcement Learning for Continuous ControlCode1
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy OptimizationCode1
FACMAC: Factored Multi-Agent Centralised Policy GradientsCode1
State-only Imitation with Transition Dynamics MismatchCode1
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation ErrorsCode1
VariBAD: A Very Good Method for Bayes-Adaptive Deep RL via Meta-LearningCode1
Improving Sample Efficiency in Model-Free Reinforcement Learning from ImagesCode1
Self-Supervised Exploration via DisagreementCode1
Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the PastCode1
SQIL: Imitation Learning via Reinforcement Learning with Sparse RewardsCode1
The StarCraft Multi-Agent ChallengeCode1
Simple random search provides a competitive approach to reinforcement learningCode1
DeepMind Control SuiteCode1
Learnings Options End-to-End for Continuous Action TasksCode1
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximationCode1
DART: Noise Injection for Robust Imitation LearningCode1
Evolution Strategies as a Scalable Alternative to Reinforcement LearningCode1
Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback0
Turning Sand to Gold: Recycling Data to Bridge On-Policy and Off-Policy Learning via Causal Bound0
Safe Domain Randomization via Uncertainty-Aware Out-of-Distribution Detection and Policy Adaptation0
Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study0
Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across DomainsCode0
rQdia: Regularizing Q-Value Distributions With Image Augmentation0
Beyond-Expert Performance with Limited Demonstrations: Efficient Imitation Learning with Double Exploration0
ADDQ: Adaptive Distributional Double Q-LearningCode0
Learning Task Belief Similarity with Latent Dynamics for Meta-Reinforcement LearningCode0
Hard Contacts with Soft Gradients: Refining Differentiable Simulators for Learning and Control0
The Courage to Stop: Overcoming Sunk Cost Fallacy in Deep Reinforcement Learning0
Wasserstein Barycenter Soft Actor-Critic0
Modular Recurrence in Contextual MDPs for Universal Morphology Control0
MOBODY: Model Based Off-Dynamics Offline Reinforcement LearningCode0
Accelerating Diffusion Models in Offline RL via Reward-Aware Consistency Trajectory Distillation0
LLMs for sensory-motor control: Combining in-context and iterative learningCode0
Measure gradients, not activations! Enhancing neuronal activity in deep reinforcement learning0
Enhanced DACER Algorithm with High Diffusion Efficiency0
ADG: Ambient Diffusion-Guided Dataset Recovery for Corruption-Robust Offline Reinforcement Learning0
Show:102550
← PrevPage 3 of 14Next →

No leaderboard results yet.