SOTAVerified

OpenAI Gym

An open-source toolkit from OpenAI that implements several Reinforcement Learning benchmarks including: classic control, Atari, Robotics and MuJoCo tasks.

(Description by Evolutionary learning of interpretable decision trees)

(Image Credit: OpenAI Gym)

Papers

Showing 151175 of 382 papers

TitleStatusHype
A Deep Reinforcement Learning Strategy for UAV Autonomous Landing on a Platform0
An Oracle and Observations for the OpenAI Gym / ALE Freeway Environment0
Implicit Sensing in Traffic Optimization: Advanced Deep Reinforcement Learning Techniques0
Extended Radial Basis Function Controller for Reinforcement Learning0
Exploring Deep Recurrent Models with Reinforcement Learning for Molecule Design0
Exploration and preference satisfaction trade-off in reward-free learning0
Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts0
Concept and the implementation of a tool to convert industry 4.0 environments modeled as FSM to an OpenAI Gym wrapper0
FuzzerGym: A Competitive Framework for Fuzzing and Learning0
A Dual Memory Structure for Efficient Use of Replay Memory in Deep Reinforcement Learning0
Improving Reinforcement Learning with Human Assistance: An Argument for Human Subject Studies with HIPPO Gym0
Experience Replay More When It's a Key Transition in Deep Reinforcement Learning0
AppBuddy: Learning to Accomplish Tasks in Mobile Apps via Reinforcement Learning0
Graph Decision Transformer0
Continuous-time Value Function Approximation in Reproducing Kernel Hilbert Spaces0
An Independent Study of Reinforcement Learning and Autonomous Driving0
Exact and Cost-Effective Automated Transformation of Neural Network Controllers to Decision Tree Controllers0
Combining Model-based and Model-free RL via Multi-step Control Variates0
Gym-saturation: an OpenAI Gym environment for saturation provers0
gym-saturation: Gymnasium environments for saturation provers (System description)0
Accelerating Reinforcement Learning with Suboptimal Guidance0
EVO-RL: Evolutionary-Driven Reinforcement Learning0
Evolving Neural Networks in Reinforcement Learning by means of UMDAc0
HoME: a Household Multimodal Environment0
Cluster-based Sampling in Hindsight Experience Replay for Robotic Tasks (Student Abstract)0
Show:102550
← PrevPage 7 of 16Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MEowAverage Return6,586.33Unverified
2TD3Average Return5,942.55Unverified
3SACAverage Return5,208.09Unverified
4DDPGAverage Return1,712.12Unverified
5PPOAverage Return608.97Unverified
#ModelMetricClaimedVerifiedStatus
1SACAverage Return15,836.04Unverified
2DDPGAverage Return14,934.86Unverified
3TD3Average Return12,026.73Unverified
4MEowAverage Return10,981.47Unverified
5PPOAverage Return6,006.11Unverified
#ModelMetricClaimedVerifiedStatus
1MEowAverage Return3,332.99Unverified
2TD3Average Return3,319.98Unverified
3SACAverage Return2,882.56Unverified
4DDPGAverage Return1,290.24Unverified
5PPOAverage Return790.77Unverified
#ModelMetricClaimedVerifiedStatus
1MEowAverage Return6,923.22Unverified
2SACAverage Return6,211.5Unverified
3PPOAverage Return925.89Unverified
4TD3Average Return198.44Unverified
5DDPGAverage Return139.14Unverified
#ModelMetricClaimedVerifiedStatus
1SACAverage Return5,745.27Unverified
2MEowAverage Return5,526.66Unverified
3DDPGAverage Return2,994.54Unverified
4PPOAverage Return2,739.81Unverified
5TD3Average Return2,612.74Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward5,163.54Unverified
2AWRMean Reward5,067Unverified
#ModelMetricClaimedVerifiedStatus
1Orthogonal decision treeAverage Return500Unverified
2Oblique decision treeAverage Return500Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward9,571.99Unverified
2AWRMean Reward9,136Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward3,458.22Unverified
2AWRMean Reward3,405Unverified
#ModelMetricClaimedVerifiedStatus
1Oblique decision treeAverage Return272.14Unverified
2AWRAverage Return229Unverified
#ModelMetricClaimedVerifiedStatus
1Orthogonal decision treeAverage Return-101.72Unverified
2Oblique decision treeAverage Return-106.02Unverified
#ModelMetricClaimedVerifiedStatus
1TLA with Hierarchical Reward FunctionsMean Reward-125.02Unverified
2TLAMean Reward-154.92Unverified
#ModelMetricClaimedVerifiedStatus
1AWRMean Reward5,813Unverified
2TLAMean Reward3,878.41Unverified
#ModelMetricClaimedVerifiedStatus
1AWRAverage Return4,996Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward9,356.67Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward1,000Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward93.88Unverified