SOTAVerified

OpenAI Gym

An open-source toolkit from OpenAI that implements several Reinforcement Learning benchmarks including: classic control, Atari, Robotics and MuJoCo tasks.

(Description by Evolutionary learning of interpretable decision trees)

(Image Credit: OpenAI Gym)

Papers

Showing 251300 of 382 papers

TitleStatusHype
CrowdPlay: Crowdsourcing human demonstration data for offline learning in Atari games0
CT-DQN: Control-Tutored Deep Reinforcement Learning0
Curiosity-Driven Experience Prioritization via Density Estimation0
Data Driven Control with Learned Dynamics: Model-Based versus Model-Free Approach0
Dealing with Sparse Rewards in Continuous Control Robotics via Heavy-Tailed Policies0
Deep Learning of Koopman Representation for Control0
Deep Q Learning from Dynamic Demonstration with Behavioral Cloning0
Deep Q-Learning with Q-Matrix Transfer Learning for Novel Fire Evacuation Environment0
Deep Q-Network Based Multi-agent Reinforcement Learning with Binary Action Agents0
Deep Reinforcement Learning for ESG financial portfolio management0
Deep Reinforcement Learning with Mixed Convolutional Network0
Design of Artificial Intelligence Agents for Games using Deep Reinforcement Learning0
DESTA: A Framework for Safe Reinforcement Learning with Markov Games of Intervention0
Differentially Private Temporal Difference Learning with Stochastic Nonconvex-Strongly-Concave Optimization0
Direct Mutation and Crossover in Genetic Algorithms Applied to Reinforcement Learning Tasks0
Discovering Individual Rewards in Collective Behavior through Inverse Multi-Agent Reinforcement Learning0
Distilling Deep RL Models Into Interpretable Neuro-Fuzzy Systems0
Distributionally Robust Statistical Verification with Imprecise Neural Networks0
Double A3C: Deep Reinforcement Learning on OpenAI Gym Games0
DQN with model-based exploration: efficient learning on environments with sparse rewards0
DriverGym: Democratising Reinforcement Learning for Autonomous Driving0
Easy as ABCs: Unifying Boltzmann Q-Learning and Counterfactual Regret Minimization0
EasyRL: A Simple and Extensible Reinforcement Learning Framework0
Elastic Step DQN: A novel multi-step algorithm to alleviate overestimation in Deep QNetworks0
Enhancing Cyber Resilience of Networked Microgrids using Vertical Federated Reinforcement Learning0
Enhancing Hardware Fault Tolerance in Machines with Reinforcement Learning Policy Gradient Algorithms0
Enhancing Privacy and Security of Autonomous UAV Navigation0
Error Controlled Actor-Critic Method to Reinforcement Learning0
Evading Web Application Firewalls with Reinforcement Learning0
Evolutionary Selective Imitation: Interpretable Agents by Imitation Learning Without a Demonstrator0
Evolving Neural Networks in Reinforcement Learning by means of UMDAc0
EVO-RL: Evolutionary-Driven Reinforcement Learning0
Exact and Cost-Effective Automated Transformation of Neural Network Controllers to Decision Tree Controllers0
Experience Replay More When It's a Key Transition in Deep Reinforcement Learning0
Exploration and preference satisfaction trade-off in reward-free learning0
Exploring Deep Recurrent Models with Reinforcement Learning for Molecule Design0
Extended Radial Basis Function Controller for Reinforcement Learning0
FuzzerGym: A Competitive Framework for Fuzzing and Learning0
GeneSys: Enabling Continuous Learning through Neural Network Evolution in Hardware0
Graph Decision Transformer0
Gym-preCICE: Reinforcement Learning Environments for Active Flow Control0
Gym-saturation: an OpenAI Gym environment for saturation provers0
gym-saturation: Gymnasium environments for saturation provers (System description)0
HoME: a Household Multimodal Environment0
HomeLabGym: A real-world testbed for home energy management systems0
Human AI interaction loop training: New approach for interactive reinforcement learning0
Hybrid Policies Using Inverse Rewards for Reinforcement Learning0
Hypothesis Driven Coordinate Ascent for Reinforcement Learning0
Illuminating Spaces: Deep Reinforcement Learning and Laser-Wall Partitioning for Architectural Layout Generation0
Imaginary Hindsight Experience Replay: Curious Model-based Learning for Sparse Reward Tasks0
Show:102550
← PrevPage 6 of 8Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1MEowAverage Return6,586.33Unverified
2TD3Average Return5,942.55Unverified
3SACAverage Return5,208.09Unverified
4DDPGAverage Return1,712.12Unverified
5PPOAverage Return608.97Unverified
#ModelMetricClaimedVerifiedStatus
1SACAverage Return15,836.04Unverified
2DDPGAverage Return14,934.86Unverified
3TD3Average Return12,026.73Unverified
4MEowAverage Return10,981.47Unverified
5PPOAverage Return6,006.11Unverified
#ModelMetricClaimedVerifiedStatus
1MEowAverage Return3,332.99Unverified
2TD3Average Return3,319.98Unverified
3SACAverage Return2,882.56Unverified
4DDPGAverage Return1,290.24Unverified
5PPOAverage Return790.77Unverified
#ModelMetricClaimedVerifiedStatus
1MEowAverage Return6,923.22Unverified
2SACAverage Return6,211.5Unverified
3PPOAverage Return925.89Unverified
4TD3Average Return198.44Unverified
5DDPGAverage Return139.14Unverified
#ModelMetricClaimedVerifiedStatus
1SACAverage Return5,745.27Unverified
2MEowAverage Return5,526.66Unverified
3DDPGAverage Return2,994.54Unverified
4PPOAverage Return2,739.81Unverified
5TD3Average Return2,612.74Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward5,163.54Unverified
2AWRMean Reward5,067Unverified
#ModelMetricClaimedVerifiedStatus
1Orthogonal decision treeAverage Return500Unverified
2Oblique decision treeAverage Return500Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward9,571.99Unverified
2AWRMean Reward9,136Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward3,458.22Unverified
2AWRMean Reward3,405Unverified
#ModelMetricClaimedVerifiedStatus
1Oblique decision treeAverage Return272.14Unverified
2AWRAverage Return229Unverified
#ModelMetricClaimedVerifiedStatus
1Orthogonal decision treeAverage Return-101.72Unverified
2Oblique decision treeAverage Return-106.02Unverified
#ModelMetricClaimedVerifiedStatus
1TLA with Hierarchical Reward FunctionsMean Reward-125.02Unverified
2TLAMean Reward-154.92Unverified
#ModelMetricClaimedVerifiedStatus
1AWRMean Reward5,813Unverified
2TLAMean Reward3,878.41Unverified
#ModelMetricClaimedVerifiedStatus
1AWRAverage Return4,996Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward9,356.67Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward1,000Unverified
#ModelMetricClaimedVerifiedStatus
1TLAMean Reward93.88Unverified