C-3PO: Cyclic-Three-Phase Optimization for Human-Robot Motion Retargeting based on Reinforcement Learning

2019-09-25Code Available0· sign in to hype

Taewoo Kim, Joo-Haeng Lee

Code Available — Be the first to reproduce this paper.

Code

github.com/gd-goblin/NTU_DB_Data_Loader
OfficialIn papernone★ 0
github.com/gd-goblin/C-3PO_RobotControlModule
none★ 0
github.com/gd-goblin/C-3PO_Motion_Retargeting_Module
pytorch★ 0

Abstract

Motion retargeting between heterogeneous polymorphs with different sizes and kinematic configurations requires a comprehensive knowledge of (inverse) kinematics. Moreover, it is non-trivial to provide a kinematic independent general solution. In this study, we developed a cyclic three-phase optimization method based on deep reinforcement learning for human-robot motion retargeting. The motion retargeting learning is performed using refined data in a latent space by the cyclic and filtering paths of our method. In addition, the human-in-the-loop based three-phase approach provides a framework for the improvement of the motion retargeting policy by both quantitative and qualitative manners. Using the proposed C-3PO method, we were successfully able to learn the motion retargeting skill between the human skeleton and motion of the multiple robots such as NAO, Pepper, Baxter and C-3PO.

Tasks

Deep Reinforcement Learning motion retargeting reinforcement-learning Reinforcement Learning Reinforcement Learning (RL)

C-3PO: Cyclic-Three-Phase Optimization for Human-Robot Motion Retargeting based on Reinforcement Learning

Code

Abstract

Tasks

Reproductions