Pose Estimation
Pose Estimation is a computer vision task where the goal is to detect the position and orientation of a person or an object. Usually, this is done by predicting the location of specific keypoints like hands, head, elbows, etc. in case of Human Pose Estimation.
A common benchmark for this task is MPII Human Pose
( Image credit: Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose )
Papers
Showing 1–10 of 4228 papers
All datasetsCOCO test-devMPII Human PoseOCHumanLeeds Sports PosesCrowdPoseCOCO val2017AICCOCO (Common Objects in Context)InLocITOP front-viewJ-HMDBMPII Single Person
Benchmark Results
| # | Model | Metric | Claimed | Verified | Status |
|---|---|---|---|---|---|
| 1 | CCNet (ViTPose-B_GT-bbox_256x192) | AP | 78.1 | — | Unverified |
| 2 | MogaNet-B (384x288) | AP | 77.3 | — | Unverified |
| 3 | ViTPose-B (Single-task_GT-bbox_256x192) | AP | 77.3 | — | Unverified |
| 4 | MogaNet-S (384x288) | AP | 76.4 | — | Unverified |
| 5 | Bias (HRNet_256x192) | AP | 75.8 | — | Unverified |
| 6 | ViTPose-B (Single-task_Det-bbox_256x192) | AP | 75.8 | — | Unverified |
| 7 | HRNet (256x192) | AP | 75.3 | — | Unverified |
| 8 | MogaNet-S (256x192) | AP | 74.9 | — | Unverified |
| 9 | MogaNet-T (256x192) | AP | 73.2 | — | Unverified |
| 10 | RLE (256x192) | AP | 71.3 | — | Unverified |