SOTAVerified

MuJoCo

Papers

Showing 150 of 677 papers

TitleStatusHype
MuJoCo MPC for Humanoid Control: Evaluation on HumanoidBenchCode5
Enhancing Efficiency of Safe Reinforcement Learning via Sample ManipulationCode5
Balance Reward and Safety Optimization for Safe Reinforcement Learning: A Perspective of Gradient ManipulationCode5
Humanoid-Gym: Reinforcement Learning for Humanoid Robot with Zero-Shot Sim2Real TransferCode5
EnvPool: A Highly Parallel Reinforcement Learning Environment Execution EngineCode5
Streaming Deep Reinforcement Learning Finally WorksCode3
XuanCe: A Comprehensive and Unified Deep Reinforcement Learning LibraryCode3
Learning Bipedal Walking On Planned Footsteps For Humanoid RobotsCode3
Tianshou: a Highly Modularized Deep Reinforcement Learning LibraryCode3
Meta-DT: Offline Meta-RL as Conditional Sequence Modeling with World Model DisentanglementCode2
Any-step Dynamics Model Improves Future Predictions for Online and Offline Reinforcement LearningCode2
Diffusion-based Reinforcement Learning via Q-weighted Variational Policy OptimizationCode2
Diffusion Actor-Critic with Entropy RegulatorCode2
Simple Policy OptimizationCode2
Text2Reward: Reward Shaping with Language Models for Reinforcement LearningCode2
Maximum Entropy Heterogeneous-Agent Reinforcement LearningCode2
Multi-Agent Reinforcement Learning is a Sequence Modeling ProblemCode2
JORLDY: a fully customizable open source framework for reinforcement learningCode2
Brax -- A Differentiable Physics Engine for Large Scale Rigid Body SimulationCode2
robosuite: A Modular Simulation Framework and Benchmark for Robot LearningCode2
Deep Reinforcement Learning with Gradient Eligibility TracesCode1
Reinforcement Learning for Ballbot Navigation in Uneven TerrainCode1
Model Tensor PlanningCode1
An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable SimulationCode1
Maximum Entropy Reinforcement Learning with Diffusion PolicyCode1
Doubly Mild Generalization for Offline Reinforcement LearningCode1
FM-TS: Flow Matching for Time Series GenerationCode1
Zonal RL-RRT: Integrated RL-RRT Path Planning with Collision Probability and Zone ConnectivityCode1
Learning Successor Features the Simple WayCode1
Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximationsCode1
Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement LearningCode1
LLM-Empowered State Representation for Reinforcement LearningCode1
Energy-Guided Diffusion Sampling for Offline-to-Online Reinforcement LearningCode1
RRLS : Robust Reinforcement Learning SuiteCode1
Skill-aware Mutual Information Optimisation for Generalisation in Reinforcement LearningCode1
Maximum Entropy Reinforcement Learning via Energy-Based Normalizing FlowCode1
S^2AC: Energy-Based Reinforcement Learning with Stein Soft Actor CriticCode1
No Representation, No Trust: Connecting Representation, Collapse, and Trust Issues in PPOCode1
UCB-driven Utility Function Search for Multi-objective Reinforcement LearningCode1
Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space InferenceCode1
Efficient Reinforcement Learning via Decoupling Exploration and UtilizationCode1
World Models via Policy-Guided Trajectory DiffusionCode1
Optimistic Multi-Agent Policy GradientCode1
Vision-Language Models are Zero-Shot Reward Models for Reinforcement LearningCode1
Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory SamplingCode1
A Bayesian Approach to Robust Inverse Reinforcement LearningCode1
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value RegularizationCode1
Natural Actor-Critic for Robust Reinforcement Learning with Function ApproximationCode1
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and ExplorationCode1
Policy Representation via Diffusion Probability Model for Reinforcement LearningCode1
Show:102550
← PrevPage 1 of 14Next →

No leaderboard results yet.