Rethinking on Multi-Stage Networks for Human Pose Estimation
Wenbo Li, Zhicheng Wang, Binyi Yin, Qixiang Peng, Yuming Du, Tianzi Xiao, Gang Yu, Hongtao Lu, Yichen Wei, Jian Sun
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/megvii-detection/MSPNOfficialIn paperpytorch★ 0
- github.com/open-mmlab/mmposepytorch★ 7,439
- github.com/chenyilun95/tf-cpntf★ 795
- github.com/hyperionfalling/lightposepytorch★ 0
- github.com/yangyucheng000/MSPNmindspore★ 0
- github.com/xiuyu0000/papers_with_examples/tree/main/mspnmindspore★ 0
- github.com/fenglinglwb/MSPNpytorch★ 0
Abstract
Existing pose estimation approaches fall into two categories: single-stage and multi-stage methods. While multi-stage methods are seemingly more suited for the task, their performance in current practice is not as good as single-stage methods. This work studies this issue. We argue that the current multi-stage methods' unsatisfactory performance comes from the insufficiency in various design choices. We propose several improvements, including the single-stage module design, cross stage feature aggregation, and coarse-to-fine supervision. The resulting method establishes the new state-of-the-art on both MS COCO and MPII Human Pose dataset, justifying the effectiveness of a multi-stage architecture. The source code is publicly available for further research.
Tasks
Benchmark Results
| Dataset | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| COCO minival | MSPN | AP | 75.9 | — | Unverified |
| COCO test-dev | MSPN | AP | 76.1 | — | Unverified |
| MPII Human Pose | MSPN | PCKh-0.5 | 92.6 | — | Unverified |