SOTAVerified

MuJoCo

Papers

Showing 201225 of 677 papers

TitleStatusHype
Effects of sparse rewards of different magnitudes in the speed of learning of model-based actor critic methods0
Efficient Diversity-based Experience Replay for Deep Reinforcement Learning0
Efficiently Training On-Policy Actor-Critic Networks in Robotic Deep Reinforcement Learning with Demonstration-like Sampled Exploration0
Efficient Model-Based Reinforcement Learning Through Optimistic Thompson Sampling0
Boosting Exploration in Actor-Critic Algorithms by Incentivizing Plausible Novel States0
ELSIM: End-to-end learning of reusable skills through intrinsic motivation0
Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning0
Disentangling Dynamics and Returns: Value Function Decomposition with Future Prediction0
Benchmarking the Sim-to-Real Gap in Cloth Manipulation0
ALOHA 2: An Enhanced Low-Cost Hardware for Bimanual Teleoperation0
EnsembleDAgger: A Bayesian Approach to Safe Imitation Learning0
Entropy Augmented Reinforcement Learning0
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback0
Behavioral Entropy-Guided Dataset Generation for Offline Reinforcement Learning0
Bridging Physics-Informed Neural Networks with Reinforcement Learning: Hamilton-Jacobi-Bellman Proximal Policy Optimization (HJBPPO)0
Episodic Reinforcement Learning with Expanded State-reward Space0
Estimating Disentangled Belief about Hidden State and Hidden Task for Meta-RL0
Evaluating Robustness of Cooperative MARL0
DIDA: Denoised Imitation Learning based on Domain Adaptation0
Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication0
DexDLO: Learning Goal-Conditioned Dexterous Policy for Dynamic Manipulation of Deformable Linear Objects0
Evolving Rewards to Automate Reinforcement Learning0
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models0
CAMEL: Continuous Action Masking Enabled by Large Language Models for Reinforcement Learning0
A Logarithmic Barrier Method For Proximal Policy Optimization0
Show:102550
← PrevPage 9 of 28Next →

No leaderboard results yet.