Train a Real-world Local Path Planner in One Hour via Partially Decoupled Reinforcement Learning and Vectorized Diversity
Jinghao Xin, Jinwoo Kim, Zhi Li, Ning li
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/xinjinghao/colorOfficialIn papernone★ 22
Abstract
Deep Reinforcement Learning (DRL) has exhibited efficacy in resolving the Local Path Planning (LPP) problem. However, such application in the real world is immensely limited due to the deficient training efficiency and generalization capability of DRL. To alleviate these two issues, a solution named Color is proposed, which consists of an Actor-Sharer-Learner (ASL) training framework and a mobile robot-oriented simulator Sparrow. Specifically, the ASL intends to improve the training efficiency of DRL algorithms. It employs a Vectorized Data Collection (VDC) mode to expedite data acquisition, decouples the data collection from model optimization by multithreading, and partially connects the two procedures by harnessing a Time Feedback Mechanism (TFM) to evade data underuse or overuse. Meanwhile, the Sparrow simulator utilizes a 2D grid-based world, simplified kinematics, and conversion-free data flow to achieve a lightweight design. The lightness facilitates vectorized diversity, allowing diversified simulation setups across extensive copies of the vectorized environments, resulting in a notable enhancement in the generalization capability of the DRL algorithm being trained. Comprehensive experiments, comprising 57 DRL benchmark environments, 32 simulated and 36 real-world LPP scenarios, have been conducted to corroborate the superiority of our method in terms of efficiency and generalization. The code and the video of this paper are accessible at https://github.com/XinJingHao/Color.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| Atari 2600 Alien | ASL DDQN | Score | 6,955.2 | — | Unverified |
| Atari 2600 Amidar | ASL DDQN | Score | 2,232.3 | — | Unverified |
| Atari 2600 Assault | ASL DDQN | Score | 14,372.8 | — | Unverified |
| Atari 2600 Asterix | ASL DDQN | Score | 567,640 | — | Unverified |
| Atari 2600 Asteroids | ASL DDQN | Score | 1,984.5 | — | Unverified |
| Atari 2600 Atlantis | ASL DDQN | Score | 947,275 | — | Unverified |
| Atari 2600 Bank Heist | ASL DDQN | Score | 1,340.9 | — | Unverified |
| Atari 2600 Battle Zone | ASL DDQN | Score | 38,986 | — | Unverified |
| Atari 2600 Beam Rider | ASL DDQN | Score | 26,841.6 | — | Unverified |
| Atari 2600 Berzerk | ASL DDQN | Score | 2,597.2 | — | Unverified |
| Atari 2600 Bowling | ASL DDQN | Score | 62.4 | — | Unverified |
| Atari 2600 Boxing | ASL DDQN | Score | 99.6 | — | Unverified |
| Atari 2600 Breakout | ASL DDQN | Score | 621.7 | — | Unverified |
| Atari 2600 Centipede | ASL DDQN | Score | 3,899.8 | — | Unverified |
| Atari 2600 Chopper Command | ASL DDQN | Score | 15,071 | — | Unverified |
| Atari 2600 Crazy Climber | ASL DDQN | Score | 166,019 | — | Unverified |
| Atari 2600 Defender | ASL DDQN | Score | 37,026.5 | — | Unverified |
| Atari 2600 Demon Attack | ASL DDQN | Score | 119,773.9 | — | Unverified |
| Atari 2600 Double Dunk | ASL DDQN | Score | 0.1 | — | Unverified |
| Atari 2600 Enduro | ASL DDQN | Score | 2,103.1 | — | Unverified |
| Atari 2600 Fishing Derby | ASL DDQN | Score | 35.1 | — | Unverified |
| Atari 2600 Freeway | ASL DDQN | Score | 33.9 | — | Unverified |
| Atari 2600 Frostbite | ASL DDQN | Score | 8,616.4 | — | Unverified |
| Atari 2600 Gopher | ASL DDQN | Score | 103,514.4 | — | Unverified |
| Atari 2600 Gravitar | ASL DDQN | Score | 760 | — | Unverified |
| Atari 2600 HERO | ASL DDQN | Score | 26,578.5 | — | Unverified |
| Atari 2600 Ice Hockey | ASL DDQN | Score | -3.6 | — | Unverified |
| Atari 2600 James Bond | ASL DDQN | Score | 2,237 | — | Unverified |
| Atari 2600 Kangaroo | ASL DDQN | Score | 13,027 | — | Unverified |
| Atari 2600 Krull | ASL DDQN | Score | 10,422.5 | — | Unverified |
| Atari 2600 Kung-Fu Master | ASL DDQN | Score | 85,182 | — | Unverified |
| Atari 2600 Montezuma's Revenge | ASL DDQN | Score | 0 | — | Unverified |
| Atari 2600 Ms. Pacman | ASL DDQN | Score | 4,416 | — | Unverified |
| Atari 2600 Name This Game | ASL DDQN | Score | 16,535.4 | — | Unverified |
| Atari 2600 Phoenix | ASL DDQN | Score | 71,752.6 | — | Unverified |
| Atari 2600 Pitfall! | ASL DDQN | Score | 0 | — | Unverified |
| Atari 2600 Pong | ASL DDQN | Score | 21 | — | Unverified |
| Atari 2600 Private Eye | ASL DDQN | Score | 349.7 | — | Unverified |
| Atari 2600 Q*Bert | ASL DDQN | Score | 24,548.8 | — | Unverified |
| Atari 2600 River Raid | ASL DDQN | Score | 24,445 | — | Unverified |
| Atari 2600 Road Runner | ASL DDQN | Score | 56,520 | — | Unverified |
| Atari 2600 Robotank | ASL DDQN | Score | 65.8 | — | Unverified |
| Atari 2600 Seaquest | ASL DDQN | Score | 29,278.6 | — | Unverified |
| Atari 2600 Skiing | ASL DDQN | Score | -8,295.4 | — | Unverified |
| Atari 2600 Solaris | ASL DDQN | Score | 3,506.8 | — | Unverified |
| Atari 2600 Space Invaders | ASL DDQN | Score | 21,602 | — | Unverified |
| Atari 2600 Star Gunner | ASL DDQN | Score | 129,140 | — | Unverified |
| Atari 2600 Surround | ASL DDQN | Score | 2.5 | — | Unverified |
| Atari 2600 Tennis | ASL DDQN | Score | 22.3 | — | Unverified |
| Atari 2600 Time Pilot | ASL DDQN | Score | 12,071 | — | Unverified |