SOTAVerified

MuJoCo

Papers

Showing 301350 of 677 papers

TitleStatusHype
CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture0
Adapting Double Q-Learning for Continuous Reinforcement Learning0
Iterative Reachability Estimation for Safe Reinforcement Learning0
Distributionally Robust Statistical Verification with Imprecise Neural Networks0
Careful at Estimation and Bold at Exploration0
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy OptimizationCode0
DMFC-GraspNet: Differentiable Multi-Fingered Robotic Grasp Generation in Cluttered Scenes0
BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel OptimizationCode0
Variance Control for Distributional Reinforcement LearningCode0
Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning0
Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs0
Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environmentCode0
Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning0
Learning non-Markovian Decision-Making from State-only SequencesCode0
CEIL: Generalized Contextual Imitation Learning0
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy ImitationCode0
Evolutionary Strategy Guided Reinforcement Learning via MultiBuffer Communication0
Surfer: Progressive Reasoning with World Models for Robotic Manipulation0
Adaptive Ensemble Q-learning: Minimizing Estimation Bias via Error Feedback0
AdaStop: adaptive statistical testing for sound comparisons of Deep RL agentsCode0
Mimicking Better by Matching the Approximate Action DistributionCode0
Recurrent Action Transformer with MemoryCode0
Language to Rewards for Robotic Skill Synthesis0
Robust Reinforcement Learning through Efficient Adversarial Herding0
Mildly Constrained Evaluation Policy for Offline Reinforcement LearningCode0
ReLU to the Rescue: Improve Your On-Policy Actor-Critic with Positive AdvantagesCode0
MetaDiffuser: Diffusion Model as Conditional Planner for Offline Meta-RL0
A Model-Based Solution to the Offline Multi-Agent Reinforcement Learning Coordination Problem0
Inverse Reinforcement Learning with the Average Reward Criterion0
OER: Offline Experience Replay for Continual Offline Reinforcement Learning0
TOM: Learning Policy-Aware Models for Model-Based Reinforcement Learning via Transition Occupancy Matching0
Unsupervised Discovery of Continuous Skills on a Sphere0
Off-Policy Average Reward Actor-Critic with Deterministic Policy SearchCode0
Client Selection for Federated Policy Optimization with Environment HeterogeneityCode0
Bayesian Reparameterization of Reward-Conditioned Reinforcement Learning with Energy-based Models0
Coagent Networks: Generalized and Scaled0
Delay-Adapted Policy Optimization and Improved Regret for Adversarial MDP with Delayed Bandit Feedback0
DEFENDER: DTW-Based Episode Filtering Using Demonstrations for Enhancing RL Safety0
Explaining RL Decisions with TrajectoriesCode0
Simple Noisy Environment Augmentation for Reinforcement LearningCode0
Meta-Reinforcement Learning Based on Self-Supervised Task Representation Learning0
Feudal Graph Reinforcement LearningCode0
Learning Complicated Manipulation Skills via Deterministic Policy with Limited Demonstrations0
Merging Decision Transformers: Weight Averaging for Forming Multi-Task PoliciesCode0
Sample-efficient Adversarial Imitation Learning0
Soft Actor-Critic Algorithm with Truly-satisfied Inequality Constraint0
A Strategy-Oriented Bayesian Soft Actor-Critic Model0
Controlled Diversity with Preference : Towards Learning a Diverse Set of Desired SkillsCode0
Wasserstein Actor-Critic: Directed Exploration via Optimism for Continuous-Actions Control0
Decision Transformer under Random Frame DroppingCode0
Show:102550
← PrevPage 7 of 14Next →

No leaderboard results yet.