SOTAVerified

MuJoCo

Papers

Showing 201250 of 677 papers

TitleStatusHype
Effects of sparse rewards of different magnitudes in the speed of learning of model-based actor critic methods0
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning0
Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration0
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling0
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States0
ELSIM: End-to-end learning of reusable skills through intrinsic motivation0
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction0
Benchmarking the Sim-to-Real Gap in Cloth Manipulation0
ALOHA 2: An Enhanced Low-Cost Hardware for Bimanual Teleoperation0
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback0
EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning0
Entropy Augmented Reinforcement Learning0
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning0
DIDA: Denoised Imitation Learning based on Domain Adaptation0
DexDLO: Learning Goal-Conditioned Dexterous Policy for Dynamic Manipulation of Deformable Linear Objects0
Episodic Reinforcement Learning with Expanded State-reward Space0
Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-RL0
Evaluating Robustness of Cooperative MARL0
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models0
Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication0
A Logarithmic Barrier Method For Proximal Policy Optimization0
Evolving Rewards to Automate Reinforcement Learning0
Hierarchical Reinforcement Learning of Locomotion Policies in Response to Approaching Objects: A Preliminary Study0
CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning0
Can Reinforcement Learning for Continuous Control Generalize Across Physics Engines?0
Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study0
Aligning Humans and Robots via Reinforcement Learning from Implicit Human Feedback0
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback0
CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture0
Extrinsicaly Rewarded Soft Q Imitation Learning with Discriminator0
DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL Safety0
Bayesian Distributional Policy Gradients0
Fast Convergence of Softmax Policy Mirror Ascent0
FastTD3: Simple, Fast, and Capable Reinforcement Learning for Humanoid Control0
Modular Recurrence in Contextual MDPs for Universal Morphology Control0
Feasible Adversarial Robust Reinforcement Learning for Underspecified Environments0
C-GAIL: Stabilizing Generative Adversarial Imitation Learning with Control Theory0
Fight fire with fire: countering bad shortcuts in imitation learning with good shortcuts0
Heterogeneous-Agent Mirror Learning: A Continuum of Solutions to Cooperative MARL0
Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization0
First Go, then Post-Explore: the Benefits of Post-Exploration in Intrinsic Motivation0
DEER: A Delay-Resilient Framework for Reinforcement Learning with Variable Delays0
Follow the Object: Curriculum Learning for Manipulation Tasks with Imagined Goals0
Adapting World Models with Latent-State Dynamics Residuals0
Formal Language Constrained Markov Decision Processes0
CLARE: Conservative Model-Based Reward Learning for Offline Inverse Reinforcement Learning0
FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation with Parameter-Sharing Versatility0
From proprioception to long-horizon planning in novel environments: A hierarchical RL model0
Gaussian Process Policy Optimization0
Accelerating Inverse Reinforcement Learning with Expert Bootstrapping0
Show:102550
← PrevPage 5 of 14Next →

No leaderboard results yet.