SOTAVerified

OpenAI Gym

An open-source toolkit from OpenAI that implements several Reinforcement Learning benchmarks including: classic control, Atari, Robotics and MuJoCo tasks.

(Description by Evolutionary learning of interpretable decision trees)

(Image Credit: OpenAI Gym)

Papers

Showing 201250 of 382 papers

TitleStatusHype
Membership Inference Attacks Against Temporally Correlated Data in Deep Reinforcement Learning0
Decision-Making in Reinforcement Learning0
Zap Q-Learning With Nonlinear Function Approximation0
myGym: Modular Toolkit for Visuomotor Robotic Tasks0
Adaptive Planning with Generative Models under Uncertainty0
ReaCritic: Large Reasoning Transformer-based DRL Critic-model Scaling For Heterogeneous Networks0
Accelerating Reinforcement Learning with Suboptimal Guidance0
A Closed-Loop Multi-perspective Visual Servoing Approach with Reinforcement Learning0
A Comprehensive Guide to Combining R and Python code for Data Science, Machine Learning and Reinforcement Learning0
Active Inference in Hebbian Learning Networks0
Adaptive Droplet Routing in Digital Microfluidic Biochips Using Deep Reinforcement Learning0
Adaptive Experience Selection for Policy Gradient0
Adaptive Temporal Difference Learning with Linear Function Approximation0
A Deep Reinforcement Learning Strategy for UAV Autonomous Landing on a Platform0
A Dual Memory Structure for Efficient Use of Replay Memory in Deep Reinforcement Learning0
Advantage Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning0
Adversarial Body Shape Search for Legged Robots0
Adversarial Exploration Strategy for Self-Supervised Imitation Learning0
Adversarial joint attacks on legged robots0
Affine Transport for Sim-to-Real Domain Adaptation0
A Generalised Inverse Reinforcement Learning Framework0
Airlift Challenge: A Competition for Optimizing Cargo Delivery0
A Multi-Agent Reinforcement Learning Testbed for Cognitive Radio Applications0
An Analysis of Reinforcement Learning for Malaria Control0
An FPGA-Based On-Device Reinforcement Learning Approach using Online Sequential Learning0
An Independent Study of Reinforcement Learning and Autonomous Driving0
An Oracle and Observations for the OpenAI Gym / ALE Freeway Environment0
AppBuddy: Learning to Accomplish Tasks in Mobile Apps via Reinforcement Learning0
A Strategy-Oriented Bayesian Soft Actor-Critic Model0
A Surrogate-Assisted Controller for Expensive Evolutionary Reinforcement Learning0
Towards Understanding Asynchronous Advantage Actor-critic: Convergence and Linear Speedup0
Asynchronous Deep Double Duelling Q-Learning for Trading-Signal Execution in Limit Order Book Markets0
Attention Loss Adjusted Prioritized Experience Replay0
ACPO: A Policy Optimization Algorithm for Average MDPs with Constraints0
AWD3: Dynamic Reduction of the Estimation Bias0
Balancing a CartPole System with Reinforcement Learning -- A Tutorial0
Behavior Cloning in OpenAI using Case Based Reasoning0
Benchmarking Algorithms from Machine Learning for Low-Budget Black-Box Optimization0
BlockPuzzle - A Challenge in Physical Reasoning and Generalization for Robot Learning0
Causal Repair of Learning-enabled Cyber-physical Systems0
Challenging On Car Racing Problem from OpenAI gym0
Closing the loop between neural network simulators and the OpenAI Gym0
Cluster-based Sampling in Hindsight Experience Replay for Robotic Tasks (Student Abstract)0
Combining Model-based and Model-free RL via Multi-step Control Variates0
Concept and the implementation of a tool to convert industry 4.0 environments modeled as FSM to an OpenAI Gym wrapper0
Contextual Policy Transfer in Reinforcement Learning Domains via Deep Mixtures-of-Experts0
Continuous-time Value Function Approximation in Reproducing Kernel Hilbert Spaces0
Controlling an Inverted Pendulum with Policy Gradient Methods-A Tutorial0
Control-Tutored Reinforcement Learning: Towards the Integration of Data-Driven and Model-Based Control0
Correcting discount-factor mismatch in on-policy policy gradient methods0
Show:102550
← PrevPage 5 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MEowAverage Return6,586.33Unverified
2TD3Average Return5,942.55Unverified
3SACAverage Return5,208.09Unverified
4DDPGAverage Return1,712.12Unverified
5PPOAverage Return608.97Unverified
#ModelMetricClaimedVerifiedStatus
1SACAverage Return15,836.04Unverified
2DDPGAverage Return14,934.86Unverified
3TD3Average Return12,026.73Unverified
4MEowAverage Return10,981.47Unverified
5PPOAverage Return6,006.11Unverified
#ModelMetricClaimedVerifiedStatus
1MEowAverage Return3,332.99Unverified
2TD3Average Return3,319.98Unverified
3SACAverage Return2,882.56Unverified
4DDPGAverage Return1,290.24Unverified
5PPOAverage Return790.77Unverified
#ModelMetricClaimedVerifiedStatus
1MEowAverage Return6,923.22Unverified
2SACAverage Return6,211.5Unverified
3PPOAverage Return925.89Unverified
4TD3Average Return198.44Unverified
5DDPGAverage Return139.14Unverified
#ModelMetricClaimedVerifiedStatus
1SACAverage Return5,745.27Unverified
2MEowAverage Return5,526.66Unverified
3DDPGAverage Return2,994.54Unverified
4PPOAverage Return2,739.81Unverified
5TD3Average Return2,612.74Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward5,163.54Unverified
2AWRMean Reward5,067Unverified
#ModelMetricClaimedVerifiedStatus
1Orthogonal decision treeAverage Return500Unverified
2Oblique decision treeAverage Return500Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward9,571.99Unverified
2AWRMean Reward9,136Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward3,458.22Unverified
2AWRMean Reward3,405Unverified
#ModelMetricClaimedVerifiedStatus
1Oblique decision treeAverage Return272.14Unverified
2AWRAverage Return229Unverified
#ModelMetricClaimedVerifiedStatus
1Orthogonal decision treeAverage Return-101.72Unverified
2Oblique decision treeAverage Return-106.02Unverified
#ModelMetricClaimedVerifiedStatus
1TLA with Hierarchical Reward FunctionsMean Reward-125.02Unverified
2TLAMean Reward-154.92Unverified
#ModelMetricClaimedVerifiedStatus
1AWRMean Reward5,813Unverified
2TLAMean Reward3,878.41Unverified
#ModelMetricClaimedVerifiedStatus
1AWRAverage Return4,996Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward9,356.67Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward1,000Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward93.88Unverified