SOTAVerified

MuJoCo

Papers

Showing 551600 of 677 papers

TitleStatusHype
Merging Decision Transformers: Weight Averaging for Forming Multi-Task PoliciesCode0
Bayes Adaptive Monte Carlo Tree Search for Offline Model-based Reinforcement LearningCode0
Variance Penalized On-Policy and Off-Policy Actor-CriticCode0
Learning Goal Embeddings via Self-Play for Hierarchical Reinforcement LearningCode0
RIZE: Regularized Imitation Learning via Distributional Reinforcement LearningCode0
Structured Control Nets for Deep Reinforcement LearningCode0
Cyclic Policy Distillation: Sample-Efficient Sim-to-Real Reinforcement Learning with Domain RandomizationCode0
Learning Generalizable Skills from Offline Multi-Task Data for Multi-Agent CooperationCode0
Mildly Constrained Evaluation Policy for Offline Reinforcement LearningCode0
SUPERVISED POLICY UPDATECode0
Learning Calibratable Policies using Programmatic Style-ConsistencyCode0
Language as an Abstraction for Hierarchical Deep Reinforcement LearningCode0
Supervised Policy Update for Deep Reinforcement LearningCode0
Controlled Diversity with Preference : Towards Learning a Diverse Set of Desired SkillsCode0
Is Mamba Compatible with Trajectory Optimization in Offline Reinforcement Learning?Code0
Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUpCode0
Imitation Learning from Purified DemonstrationsCode0
Imitation Learning from Observations under Transition Model DisparityCode0
Robust Deep Reinforcement Learning with Adaptive Adversarial Perturbations in Action SpaceCode0
MuJoCo: A physics engine for model-based controlCode0
Weak Human Preference Supervision For Deep Reinforcement LearningCode0
Human-guided Robot Behavior Learning: A GAN-assisted Preference-based Reinforcement Learning ApproachCode0
Hierarchical Reinforcement Learning with Advantage-Based Auxiliary RewardsCode0
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy OptimizationCode0
Adaptive Exploration for Data-Efficient General Value Function EvaluationsCode0
A Quadratic Actor Network for Model-Free Reinforcement LearningCode0
Trust the Model Where It Trusts Itself -- Model-Based Actor-Critic with Uncertainty-Aware Rollout AdaptionCode0
Robust Model-Based Reinforcement Learning with an Adversarial Auxiliary ModelCode0
Hard-Thresholding Meets Evolution Strategies in Reinforcement LearningCode0
Symmetric Reinforcement Learning Loss for Robust Learning on Diverse Tasks and Model ScalesCode0
Robust Policy Gradient against Strong Data CorruptionCode0
TaSIL: Taylor Series Imitation LearningCode0
Handling Delay in Real-Time Reinforcement LearningCode0
GO-DICE: Goal-Conditioned Option-Aware Offline Imitation Learning via Stationary Distribution Correction EstimationCode0
NerveNet: Learning Structured Policy with Graph Neural NetworksCode0
Task-Aware Virtual Training: Enhancing Generalization in Meta-Reinforcement Learning for Out-of-Distribution TasksCode0
Neural Network Dynamics for Model-Based Deep Reinforcement Learning with Model-Free Fine-TuningCode0
Robust Reinforcement Learning via Adversarial training with Langevin DynamicsCode0
ROER: Regularized Optimal Experience ReplayCode0
No Need for Interactions: Robust Model-Based Imitation Learning using Neural ODECode0
TD3 with Reverse KL Regularizer for Offline Reinforcement Learning from Mixed DatasetsCode0
Generalized Adaptive Transfer Network: Enhancing Transfer Learning in Reinforcement Learning Across DomainsCode0
Novel Policy Seeking with Constrained OptimizationCode0
Continuous Control With Ensemble Deep Deterministic Policy GradientsCode0
Context-Based Soft Actor Critic for Environments with Non-stationary DynamicsCode0
Formal Language Constraints for Markov Decision ProcessesCode0
Feudal Graph Reinforcement LearningCode0
TEAC: Intergrating Trust Region and Max Entropy Actor Critic for Continuous ControlCode0
Offline Reinforcement Learning via Inverse OptimizationCode0
Variational Delayed Policy OptimizationCode0
Show:102550
← PrevPage 12 of 14Next →

No leaderboard results yet.