SOTAVerified

MuJoCo

Papers

Showing 201250 of 677 papers

TitleStatusHype
Imitation Learning from Purified DemonstrationsCode0
Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates0
FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation with Parameter-Sharing Versatility0
On Representation Complexity of Model-based and Model-free Reinforcement Learning0
CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture0
Adapting Double Q-Learning for Continuous Reinforcement Learning0
Iterative Reachability Estimation for Safe Reinforcement Learning0
Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory SamplingCode1
Text2Reward: Reward Shaping with Language Models for Reinforcement LearningCode2
A Bayesian Approach to Robust Inverse Reinforcement LearningCode1
Distributionally Robust Statistical Verification with Imprecise Neural Networks0
Careful at Estimation and Bold at Exploration0
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy OptimizationCode0
BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel OptimizationCode0
DMFC-GraspNet: Differentiable Multi-Fingered Robotic Grasp Generation in Cluttered Scenes0
Variance Control for Distributional Reinforcement LearningCode0
Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning0
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value RegularizationCode1
Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs0
Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environmentCode0
Natural Actor-Critic for Robust Reinforcement Learning with Function ApproximationCode1
Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning0
Learning non-Markovian Decision-Making from State-only SequencesCode0
CEIL: Generalized Contextual Imitation Learning0
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy ImitationCode0
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback0
Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication0
Surfer: Progressive Reasoning with World Models for Robotic Manipulation0
Maximum Entropy Heterogeneous-Agent Reinforcement LearningCode2
AdaStop: adaptive statistical testing for sound comparisons of Deep RL agentsCode0
Mimicking Better by Matching the Approximate Action DistributionCode0
Recurrent Action Transformer with MemoryCode0
Language to Rewards for Robotic Skill Synthesis0
Robust Reinforcement Learning through Efficient Adversarial Herding0
Mildly Constrained Evaluation Policy for Offline Reinforcement LearningCode0
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive AdvantagesCode0
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL0
Maximize to Explore: One Objective Function Fusing Estimation, Planning, and ExplorationCode1
A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem0
Inverse Reinforcement Learning with the Average Reward Criterion0
OER: Offline Experience Replay for Continual Offline Reinforcement Learning0
Policy Representation via Diffusion Probability Model for Reinforcement LearningCode1
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching0
Unsupervised Discovery of Continuous Skills on a Sphere0
Off-Policy Average Reward Actor-Critic with Deterministic Policy SearchCode0
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models0
Client Selection for Federated Policy Optimization with Environment HeterogeneityCode0
Coagent Networks: Generalized and Scaled0
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback0
DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL Safety0
Show:102550
← PrevPage 5 of 14Next →

No leaderboard results yet.