SOTAVerified

MuJoCo

Papers

Showing 501550 of 677 papers

TitleStatusHype
Adversarial Imitation Learning via Random Search0
Imitation Learning with Sinkhorn DistancesCode1
Forward and inverse reinforcement learning sharing network weights and hyperparameters0
Overcoming Model Bias for Robust Offline Deep Reinforcement Learning0
Contrastive Variational Reinforcement Learning for Complex ObservationsCode1
Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals0
Robust Deep Reinforcement Learning through Adversarial LossCode1
Weak Human Preference Supervision For Deep Reinforcement LearningCode0
Nengo and low-power AI hardware for robust, embedded neuroroboticsCode1
Learning to Play Cup-and-Ball with Noisy Camera ObservationsCode0
CoNES: Convex Natural Evolutionary Strategies0
Inverse Reinforcement Learning from a Gradient-based Learner0
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience ReplayCode1
Fast Adaptation via Policy-Dynamics Value FunctionsCode1
Meta-SAC: Auto-tune the Entropy Temperature of Soft Actor-Critic via MetagradientCode1
Regularly Updated Deterministic Policy Gradient Algorithm0
DDPG++: Striving for Simplicity in Continuous-control Off-Policy Reinforcement Learning0
SOAC: The Soft Option Actor-Critic Architecture0
ELSIM: End-to-end learning of reusable skills through intrinsic motivation0
dm_control: Software and Tasks for Continuous Control0
Learning Invariant Representations for Reinforcement Learning without ReconstructionCode1
Converting Biomechanical Models from OpenSim to MuJoCoCode1
MetaCURE: Meta Reinforcement Learning with Empowerment-Driven ExplorationCode1
Non-local Policy Optimization via Diversity-regularized Collaborative Exploration0
Continuous Control for Searching and Planning with a Learned Model0
Decorrelated Double Q-learning0
From proprioception to long-horizon planning in novel environments: A hierarchical RL model0
Primal Wasserstein Imitation LearningCode0
Wasserstein Distance guided Adversarial Imitation Learning with Reward Shape ExplorationCode1
Cross-Domain Imitation Learning with a Dual Structure0
Gradient Monitored Reinforcement Learning0
Novel Policy Seeking with Constrained OptimizationCode0
Stealthy and Efficient Adversarial Attacks against Deep Reinforcement Learning0
Delay-Aware Model-Based Reinforcement Learning for Continuous ControlCode1
Toward Evaluating Robustness of Deep Reinforcement Learning with Continuous Control0
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy OptimizationCode1
Evolutionary Stochastic Policy DistillationCode0
Mean-Variance Policy Iteration for Risk-Averse Reinforcement Learning0
FACMAC: Factored Multi-Agent Centralised Policy GradientsCode1
Relevance-Guided Modeling of Object Dynamics for Reinforcement Learning0
Gaussian Process Policy Optimization0
State-only Imitation with Transition Dynamics MismatchCode1
Robust Reinforcement Learning via Adversarial training with Langevin DynamicsCode0
Generalized Hidden Parameter MDPs Transferable Model-based RL in a Handful of Trials0
Multi-task Reinforcement Learning with a Planning Quasi-Metric0
Temporal-adaptive Hierarchical Reinforcement Learning0
Periodic Intra-Ensemble Knowledge Distillation for Reinforcement LearningCode0
Lyceum: An efficient and scalable ecosystem for robot learning0
Effects of sparse rewards of different magnitudes in the speed of learning of model-based actor critic methods0
SEERL: Sample Efficient Ensemble Reinforcement Learning0
Show:102550
← PrevPage 11 of 14Next →

No leaderboard results yet.