SOTAVerified

MuJoCo

Papers

Showing 451500 of 677 papers

TitleStatusHype
CKNet: A Convolutional Neural Network Based on Koopman Operator for Modeling Latent Dynamics from Pixels0
Q-Value Weighted Regression: Reinforcement Learning with Limited DataCode0
Robust Policy Gradient against Strong Data CorruptionCode0
Variance Penalized On-Policy and Off-Policy Actor-CriticCode0
GST: Group-Sparse Training for Accelerating Deep Reinforcement Learning0
Rank the Episodes: A Simple Approach for Exploration in Procedurally-Generated EnvironmentsCode1
Randomized Ensembled Double Q-Learning: Learning Fast Without a ModelCode1
Cross-Modal Domain Adaptation for Reinforcement LearningCode1
Multi-Agent Trust Region LearningCode1
CAT-SAC: Soft Actor-Critic with Curiosity-Aware Entropy Temperature0
Adaptive N-step Bootstrapping with Off-policy Data0
TEAC: Intergrating Trust Region and Max Entropy Actor Critic for Continuous ControlCode0
Invariant Representations for Reinforcement Learning without Reconstruction0
Intrinsically Guided Exploration in Meta Reinforcement Learning0
PGPS : Coupling Policy Gradient with Population-based Search0
Self-Supervised Continuous Control without Policy Gradient0
Practical Marginalized Importance Sampling with the Successor Representation0
Fine-Tuning Offline Reinforcement Learning with Model-Based Policy Optimization0
Addressing Distribution Shift in Online Reinforcement Learning with Offline Datasets0
Formal Language Constrained Markov Decision Processes0
MQES: Max-Q Entropy Search for Efficient Exploration in Continuous Reinforcement Learning0
Hellinger Distance Constrained Regression0
Locally Persistent Exploration in Continuous Control Tasks with Sparse RewardsCode0
OPAC: Opportunistic Actor-Critic0
Reset-Free Lifelong Learning with Skill-Space PlanningCode1
Offline Imitation Learning with a Misspecified Simulator0
Continuous Transition: Improving Sample Efficiency for Continuous Control Problems via MixUpCode0
Weighted Entropy Modification for Soft Actor-Critic0
Proximal Policy Optimization via Enhanced Exploration Efficiency0
Sim2Sim Evaluation of a Novel Data-Efficient Differentiable Physics Engine for Tensegrity Robots0
RealAnt: An Open-Source Low-Cost Quadruped for Education and Research in Real-World Reinforcement LearningCode1
Learning to Utilize Shaping Rewards: A New Approach of Reward Shaping0
Cooperative Heterogeneous Deep Reinforcement Learning0
Can Reinforcement Learning for Continuous Control Generalize Across Physics Engines?0
Robust Constrained Reinforcement Learning for Continuous Control with Model Misspecification0
Knowledge Transfer in Multi-Task Deep Reinforcement Learning for Continuous ControlCode1
Human-guided Robot Behavior Learning: A GAN-assisted Preference-based Reinforcement Learning ApproachCode0
Self-Imitation Learning for Robot Tasks with Sparse and Delayed RewardsCode0
Balancing Constraints and Rewards with Meta-Gradient D4PG0
Hindsight Experience Replay with Kronecker Product Approximate Curvature0
Learning Intrinsic Symbolic Rewards in Reinforcement Learning0
Reinforcement Learning with Random DelaysCode1
FORK: A Forward-Looking Actor For Model-Free Reinforcement LearningCode1
What About Taking Policy as Input of Value Function: Policy-extended Value Function Approximator0
Population-Guided Imitation Learning0
robosuite: A Modular Simulation Framework and Benchmark for Robot LearningCode2
Revisiting Design Choices in Proximal Policy OptimizationCode1
Soft policy optimization using dual-track advantage estimator0
Sample-Efficient Automated Deep Reinforcement LearningCode1
Constrained Markov Decision Processes via Backward Value Functions0
Show:102550
← PrevPage 10 of 14Next →

No leaderboard results yet.