SOTAVerified

MuJoCo

Papers

Showing 51100 of 677 papers

TitleStatusHype
An Equivalence between Loss Functions and Non-Uniform Sampling in Experience ReplayCode1
Order Matters: Agent-by-agent Policy OptimizationCode1
Partial advantage estimator for proximal policy optimizationCode1
An Open-Source Multi-Goal Reinforcement Learning Environment for Robotic Manipulation with PybulletCode1
FORK: A Forward-Looking Actor For Model-Free Reinforcement LearningCode1
An Real-Sim-Real (RSR) Loop Framework for Generalizable Robotic Policy Transfer with Differentiable SimulationCode1
Converting Biomechanical Models from OpenSim to MuJoCoCode1
RealAnt: An Open-Source Low-Cost Quadruped for Education and Research in Real-World Reinforcement LearningCode1
FM-TS: Flow Matching for Time Series GenerationCode1
Reset-Free Lifelong Learning with Skill-Space PlanningCode1
Revisiting Design Choices in Proximal Policy OptimizationCode1
AdaptDiffuser: Diffusion Models as Adaptive Self-evolving PlannersCode1
Cross-Modal Domain Adaptation for Reinforcement LearningCode1
Multi-Agent Trust Region LearningCode1
MetaCURE: Meta Reinforcement Learning with Empowerment-Driven ExplorationCode1
Learnings Options End-to-End for Continuous Action TasksCode1
Lipschitz-constrained Unsupervised Skill DiscoveryCode1
Latent Plan Transformer for Trajectory Abstraction: Planning as Latent Space InferenceCode1
Attacking Cooperative Multi-Agent Reinforcement Learning by Adversarial Minority InfluenceCode1
A Bayesian Approach to Robust Inverse Reinforcement LearningCode1
Conditioning Sparse Variational Gaussian Processes for Online Decision-makingCode1
LLM-Empowered State Representation for Reinforcement LearningCode1
Contrastive Variational Reinforcement Learning for Complex ObservationsCode1
Kaleidoscope: Learnable Masks for Heterogeneous Multi-agent Reinforcement LearningCode1
Learning Invariant Representations for Reinforcement Learning without ReconstructionCode1
Conservative Offline Distributional Reinforcement LearningCode1
A Game-Theoretic Approach to Multi-Agent Trust Region OptimizationCode1
Learning Successor Features the Simple WayCode1
Improving Generalization in Meta-RL with Imaginary Tasks from Latent Dynamics MixtureCode1
Imitation Learning with Sinkhorn DistancesCode1
Cross-modal Domain Adaptation for Cost-Efficient Visual Reinforcement LearningCode1
DeepMind Control SuiteCode1
DART: Noise Injection for Robust Imitation LearningCode1
Balanced Neural ODEs: nonlinear model order reduction and Koopman operator approximationsCode1
Improving Sample Efficiency in Model-Free Reinforcement Learning from ImagesCode1
Maximum Entropy Reinforcement Learning via Energy-Based Normalizing FlowCode1
How to Learn a Useful Critic? Model-based Action-Gradient-Estimator Policy OptimizationCode1
Delay-Aware Model-Based Reinforcement Learning for Continuous ControlCode1
Deep Reinforcement Learning with Gradient Eligibility TracesCode1
Mitigating Covariate Shift in Imitation Learning via Offline Data Without Great CoverageCode1
Joint action loss for proximal policy optimizationCode1
ARLO: A Framework for Automated Reinforcement LearningCode1
Doubly Mild Generalization for Offline Reinforcement LearningCode1
Model Tensor PlanningCode1
A Deep Reinforcement Learning Approach to Marginalized Importance Sampling with the Successor RepresentationCode1
Fast Adaptation via Policy-Dynamics Value FunctionsCode1
Boosting Soft Actor-Critic: Emphasizing Recent Experience without Forgetting the PastCode1
Natural Actor-Critic for Robust Reinforcement Learning with Function ApproximationCode1
Evolution Strategies as a Scalable Alternative to Reinforcement LearningCode1
Distributional Soft Actor-Critic: Off-Policy Reinforcement Learning for Addressing Value Estimation ErrorsCode1
Show:102550
← PrevPage 2 of 14Next →

No leaderboard results yet.