SOTAVerified

MuJoCo

Papers

Showing 201225 of 677 papers

TitleStatusHype
Imitation Learning from Purified DemonstrationsCode0
Improved Communication Efficiency in Federated Natural Policy Gradient via ADMM-based Gradient Updates0
FP3O: Enabling Proximal Policy Optimization in Multi-Agent Cooperation with Parameter-Sharing Versatility0
On Representation Complexity of Model-based and Model-free Reinforcement Learning0
CasIL: Cognizing and Imitating Skills via a Dual Cognition-Action Architecture0
Adapting Double Q-Learning for Continuous Reinforcement Learning0
Iterative Reachability Estimation for Safe Reinforcement Learning0
Practical Probabilistic Model-based Deep Reinforcement Learning by Integrating Dropout Uncertainty and Trajectory SamplingCode1
Text2Reward: Reward Shaping with Language Models for Reinforcement LearningCode2
A Bayesian Approach to Robust Inverse Reinforcement LearningCode1
Distributionally Robust Statistical Verification with Imprecise Neural Networks0
Careful at Estimation and Bold at Exploration0
Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy OptimizationCode0
BiERL: A Meta Evolutionary Reinforcement Learning Framework via Bilevel OptimizationCode0
DMFC-GraspNet: Differentiable Multi-Fingered Robotic Grasp Generation in Cluttered Scenes0
Variance Control for Distributional Reinforcement LearningCode0
Theoretically Guaranteed Policy Improvement Distilled from Model-Based Planning0
Offline Multi-Agent Reinforcement Learning with Implicit Global-to-Local Value RegularizationCode1
Scalable Multi-agent Covering Option Discovery based on Kronecker Graphs0
Exploring reinforcement learning techniques for discrete and continuous control tasks in the MuJoCo environmentCode0
Natural Actor-Critic for Robust Reinforcement Learning with Function ApproximationCode1
Beyond Conservatism: Diffusion Policies in Offline Multi-agent Reinforcement Learning0
Learning non-Markovian Decision-Making from State-only SequencesCode0
CEIL: Generalized Contextual Imitation Learning0
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy ImitationCode0
Show:102550
← PrevPage 9 of 28Next →

No leaderboard results yet.