OpenAI Gym
An open-source toolkit from OpenAI that implements several Reinforcement Learning benchmarks including: classic control, Atari, Robotics and MuJoCo tasks.
(Description by Evolutionary learning of interpretable decision trees)
(Image Credit: OpenAI Gym)
Papers
Showing 1–10 of 382 papers
All datasetsAnt-v4HalfCheetah-v4Hopper-v4Humanoid-v4Walker2d-v4Ant-v2CartPole-v1HalfCheetah-v2Hopper-v2LunarLander-v2Mountain CarPendulum-v1
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Orthogonal decision tree | Average Return | 500 | — | Unverified |
| 2 | Oblique decision tree | Average Return | 500 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Oblique decision tree | Average Return | 272.14 | — | Unverified |
| 2 | AWR | Average Return | 229 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | Orthogonal decision tree | Average Return | -101.72 | — | Unverified |
| 2 | Oblique decision tree | Average Return | -106.02 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | TLA with Hierarchical Reward Functions | Mean Reward | -125.02 | — | Unverified |
| 2 | TLA | Mean Reward | -154.92 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | AWR | Average Return | 4,996 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | TLA | Mean Reward | 9,356.67 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | TLA | Mean Reward | 1,000 | — | Unverified |
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | TLA | Mean Reward | 93.88 | — | Unverified |