SOTAVerified

OpenAI Gym

An open-source toolkit from OpenAI that implements several Reinforcement Learning benchmarks including: classic control, Atari, Robotics and MuJoCo tasks.

(Description by Evolutionary learning of interpretable decision trees)

(Image Credit: OpenAI Gym)

Papers

Showing 301350 of 382 papers

TitleStatusHype
Proximal Distilled Evolutionary Reinforcement LearningCode0
Self-Supervised State-Control through Intrinsic Mutual Information RewardsCode0
IN-RIL: Interleaved Reinforcement and Imitation Learning for Policy Fine-TuningCode0
Comparing the Efficacy of Fine-Tuning and Meta-Learning for Few-Shot Policy ImitationCode0
Generative Adversarial Neuroevolution for Control Behaviour ImitationCode0
Intelligent Trainer for Model-Based Reinforcement LearningCode0
Visual Transfer between Atari Games using Competitive Reinforcement LearningCode0
Invariant Transform Experience Replay: Data Augmentation for Deep Reinforcement LearningCode0
Collaborative Deep Reinforcement LearningCode0
Investigating the Performance and Reliability, of the Q-Learning Algorithm in Various Unknown EnvironmentsCode0
Iroko: A Framework to Prototype Reinforcement Learning for Data Center Traffic ControlCode0
GAN Q-learningCode0
Catastrophic Interference in Reinforcement Learning: A Solution Based on Context Division and Knowledge DistillationCode0
VacSIM: Learning Effective Strategies for COVID-19 Vaccine Distribution using Reinforcement LearningCode0
Bridging Dimensions: Confident Reachability for High-Dimensional ControllersCode0
BindsNET: A machine learning-oriented spiking neural networks library in PythonCode0
Optimizing Attention and Cognitive Control Costs Using Temporally-Layered ArchitecturesCode0
Signal Novelty Detection as an Intrinsic Reward for RoboticsCode0
Foresee then Evaluate: Decomposing Value Estimation with Latent Future PredictionCode0
Flappy Hummingbird: An Open Source Dynamic Simulation of Flapping Wing Robots and AnimalsCode0
Faults in Deep Reinforcement Learning Programs: A Taxonomy and A Detection ApproachCode0
Quantum Deep Reinforcement Learning for Robot Navigation TasksCode0
Evolutionary learning of interpretable decision treesCode0
Sim-Env: Decoupling OpenAI Gym Environments from Simulation ModelsCode0
Advances in Experience ReplayCode0
skrl: Modular and Flexible Library for Reinforcement LearningCode0
SLM Lab: A Comprehensive Benchmark and Modular Software Framework for Reproducible Deep Reinforcement LearningCode0
A novel DDPG method with prioritized experience replayCode0
The PlayStation Reinforcement Learning Environment (PSXLE)Code0
Estimation Error Correction in Deep Reinforcement Learning for Deterministic Actor-Critic MethodsCode0
Dynamic Observation Policies in Observation Cost-Sensitive Reinforcement LearningCode0
Recurrent Predictive State Policy NetworksCode0
Double Successive Over-Relaxation Q-Learning with an Extension to Deep Reinforcement LearningCode0
Discrete Action On-Policy Learning with Action-Value CriticCode0
MDP environments for the OpenAI GymCode0
Soft-Bellman Equilibrium in Affine Markov Games: Forward Solutions and Inverse LearningCode0
MDP Playground: An Analysis and Debug Testbed for Reinforcement LearningCode0
Mining-Gym: A Configurable RL Benchmarking Environment for Truck Dispatch SchedulingCode0
Andes_gym: A Versatile Environment for Deep Reinforcement Learning in Power SystemsCode0
Analyzing Reinforcement Learning Benchmarks with Random Weight GuessingCode0
Amortized Variational Deep Q NetworkCode0
BF++: a language for general-purpose program synthesisCode0
Modular Deep Reinforcement Learning for Continuous Motion Planning with Temporal LogicCode0
Active inference: demystified and comparedCode0
Reinforcement Learning for Improving Agent DesignCode0
Deep Reinforcement Learning with Feedback-based ExplorationCode0
Deep Reinforcement Learning for Playing 2.5D Fighting GamesCode0
MORE-3S:Multimodal-based Offline Reinforcement Learning with Shared Semantic SpacesCode0
Adaptively Calibrated Critic Estimates for Deep Reinforcement LearningCode0
AIXIjs: A Software Demo for General Reinforcement LearningCode0
Show:102550
← PrevPage 7 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MEowAverage Return6,586.33Unverified
2TD3Average Return5,942.55Unverified
3SACAverage Return5,208.09Unverified
4DDPGAverage Return1,712.12Unverified
5PPOAverage Return608.97Unverified
#ModelMetricClaimedVerifiedStatus
1SACAverage Return15,836.04Unverified
2DDPGAverage Return14,934.86Unverified
3TD3Average Return12,026.73Unverified
4MEowAverage Return10,981.47Unverified
5PPOAverage Return6,006.11Unverified
#ModelMetricClaimedVerifiedStatus
1MEowAverage Return3,332.99Unverified
2TD3Average Return3,319.98Unverified
3SACAverage Return2,882.56Unverified
4DDPGAverage Return1,290.24Unverified
5PPOAverage Return790.77Unverified
#ModelMetricClaimedVerifiedStatus
1MEowAverage Return6,923.22Unverified
2SACAverage Return6,211.5Unverified
3PPOAverage Return925.89Unverified
4TD3Average Return198.44Unverified
5DDPGAverage Return139.14Unverified
#ModelMetricClaimedVerifiedStatus
1SACAverage Return5,745.27Unverified
2MEowAverage Return5,526.66Unverified
3DDPGAverage Return2,994.54Unverified
4PPOAverage Return2,739.81Unverified
5TD3Average Return2,612.74Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward5,163.54Unverified
2AWRMean Reward5,067Unverified
#ModelMetricClaimedVerifiedStatus
1Orthogonal decision treeAverage Return500Unverified
2Oblique decision treeAverage Return500Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward9,571.99Unverified
2AWRMean Reward9,136Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward3,458.22Unverified
2AWRMean Reward3,405Unverified
#ModelMetricClaimedVerifiedStatus
1Oblique decision treeAverage Return272.14Unverified
2AWRAverage Return229Unverified
#ModelMetricClaimedVerifiedStatus
1Orthogonal decision treeAverage Return-101.72Unverified
2Oblique decision treeAverage Return-106.02Unverified
#ModelMetricClaimedVerifiedStatus
1TLA with Hierarchical Reward FunctionsMean Reward-125.02Unverified
2TLAMean Reward-154.92Unverified
#ModelMetricClaimedVerifiedStatus
1AWRMean Reward5,813Unverified
2TLAMean Reward3,878.41Unverified
#ModelMetricClaimedVerifiedStatus
1AWRAverage Return4,996Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward9,356.67Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward1,000Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward93.88Unverified