SOTAVerified

MuJoCo Games

Papers

Showing 18 of 8 papers

TitleStatusHype
Surrogate-Assisted Evolutionary Reinforcement Learning Based on Autoencoder and Hyperbolic Neural Network0
LS-IQ: Implicit Reward Regularization for Inverse Reinforcement LearningCode1
A Unified Approach to Reinforcement Learning, Quantal Response Equilibria, and Two-Player Zero-Sum GamesCode1
EDGE: Explaining Deep Reinforcement Learning PoliciesCode1
Particle Based Stochastic Policy Optimization0
IQ-Learn: Inverse soft-Q Learning for ImitationCode1
Weak Human Preference Supervision For Deep Reinforcement LearningCode0
RL Unplugged: A Suite of Benchmarks for Offline Reinforcement LearningCode0
Show:102550

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1IQ-LearnAverage Return4,362.9Unverified
2PEMIRLAverage Return846.18Unverified
3AIRL Fu et al. (2017)Average Return127.61Unverified
#ModelMetricClaimedVerifiedStatus
1IQ-LearnMean5,134Unverified
2POP3DMean3,966.01Unverified
#ModelMetricClaimedVerifiedStatus
1ParPIAverage Reward5,142Unverified
#ModelMetricClaimedVerifiedStatus
1POP3DMean3,184.54Unverified
#ModelMetricClaimedVerifiedStatus
1ParPIAverage Reward11,738Unverified
#ModelMetricClaimedVerifiedStatus
1POP3DMean1,452.09Unverified
#ModelMetricClaimedVerifiedStatus
1ParPIAverage Reward3,042Unverified
#ModelMetricClaimedVerifiedStatus
1IQ-LearnReturn5,227.1Unverified
#ModelMetricClaimedVerifiedStatus
1ParPIAverage Reward4,912Unverified
#ModelMetricClaimedVerifiedStatus
1POP3DMean4,907.64Unverified
#ModelMetricClaimedVerifiedStatus
1POP3DMean741.94Unverified
#ModelMetricClaimedVerifiedStatus
1PEMIRLAverage Return-7.37Unverified
#ModelMetricClaimedVerifiedStatus
1POP3DMean-4.29Unverified
#ModelMetricClaimedVerifiedStatus
1PEMIRLAverage Return-27.16Unverified
#ModelMetricClaimedVerifiedStatus
1PEMIRLAverage Return-74.17Unverified
#ModelMetricClaimedVerifiedStatus
1POP3DMean111.08Unverified
#ModelMetricClaimedVerifiedStatus
1ParPIAverage Reward5,201Unverified