SOTAVerified

MuJoCo

Papers

Showing 176200 of 677 papers

TitleStatusHype
Global Convergence of Natural Policy Gradient with Hessian-aided Momentum Variance Reduction0
Adaptive trajectory-constrained exploration strategy for deep reinforcement learningCode0
Efficient Reinforcement Learning via Decoupling Exploration and UtilizationCode1
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning LibraryCode3
DexDLO: Learning Goal-Conditioned Dexterous Policy for Dynamic Manipulation of Deformable Linear Objects0
OVD-Explorer: Optimism Should Not Be the Sole Pursuit of Exploration in Noisy Environments0
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction EstimationCode0
Small Dataset, Big Gains: Enhancing Reinforcement Learning by Offline Pre-Training with Model Based Augmentation0
World Models via Policy-Guided Trajectory DiffusionCode1
A Perspective of Q-value Estimation on Offline-to-Online Reinforcement Learning0
A dynamical clipping approach with task feedback for Proximal Policy OptimizationCode0
Similarity-based Knowledge Transfer for Cross-Domain Reinforcement Learning0
Supported Trust Region Optimization for Offline Reinforcement Learning0
On-Policy Policy Gradient Reinforcement Learning Without On-Policy Sampling0
An Intelligent Social Learning-based Optimization Strategy for Black-box Robotic Control with Reinforcement Learning0
Optimistic Multi-Agent Policy GradientCode1
Robust Adversarial Reinforcement Learning via Bounded Rationality Curricula0
A Tractable Inference Perspective of Offline RL0
Good Better Best: Self-Motivated Imitation Learning for noisy Demonstrations0
Mind the Model, Not the Agent: The Primacy Bias in Model-based RL0
Policy Gradient with Kernel Quadrature0
One is More: Diverse Perspectives within a Single Network for Efficient DRL0
Vision-Language Models are Zero-Shot Reward Models for Reinforcement LearningCode1
Benchmarking the Sim-to-Real Gap in Cloth Manipulation0
LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios0
Show:102550
← PrevPage 8 of 28Next →

No leaderboard results yet.