SOTAVerified

MuJoCo

Papers

Showing 101125 of 677 papers

TitleStatusHype
Reset-Free Lifelong Learning with Skill-Space PlanningCode1
Generalized Decision Transformer for Offline Hindsight Information MatchingCode1
Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the PastCode1
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience ReplayCode1
Robust Deep Reinforcement Learning through Adversarial LossCode1
ARLO: A Framework for Automated Reinforcement LearningCode1
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy OptimizationCode1
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor RepresentationCode1
An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic Manipulation with PybulletCode1
Scalable trust-region method for deep reinforcement learning using Kronecker-factored approximationCode1
FACMAC: Factored Multi-Agent Centralised Policy GradientsCode1
Conditioning Sparse Variational Gaussian Processes for Online Decision-makingCode1
An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable SimulationCode1
Imitation Learning with Sinkhorn DistancesCode1
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation ErrorsCode1
Contrastive Variational Reinforcement Learning for Complex ObservationsCode1
Joint action loss for proximal policy optimizationCode1
Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement LearningCode1
Learning Successor Features the Simple WayCode1
Maximum Entropy Reinforcement Learning with Diffusion PolicyCode1
Model Tensor PlanningCode1
Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space InferenceCode1
Learning Invariant Representations for Reinforcement Learning without ReconstructionCode1
Trust Region Policy Optimisation in Multi-Agent Reinforcement LearningCode1
Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory SamplingCode1
Show:102550
← PrevPage 5 of 28Next →

No leaderboard results yet.