SOTAVerified

Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor

2018-01-04ICML 2018Code Available1· sign in to hype

Tuomas Haarnoja, Aurick Zhou, Pieter Abbeel, Sergey Levine

Code Available — Be the first to reproduce this paper.

Reproduce

Code

Abstract

A platform for Applied Reinforcement Learning (Applied RL)

Tasks

Benchmark Results

DatasetModelMetricClaimedVerifiedStatus
Ant-v4SACAverage Return5,208.09Unverified
HalfCheetah-v4SACAverage Return15,836.04Unverified
Hopper-v4SACAverage Return2,882.56Unverified
Humanoid-v4SACAverage Return6,211.5Unverified
Walker2d-v4SACAverage Return5,745.27Unverified

Reproductions