Pose Estimation

Pose Estimation is a computer vision task where the goal is to detect the position and orientation of a person or an object. Usually, this is done by predicting the location of specific keypoints like hands, head, elbows, etc. in case of Human Pose Estimation.

A common benchmark for this task is MPII Human Pose

( Image credit: Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 501–550 of 4228 papers

Title	Date	Tasks	Status	Hype
DiffPose: Toward More Reliable 3D Pose Estimation	Nov 30, 2022	3D Human Pose Estimation3D Pose Estimation	CodeCode Available	1
Kinematic-aware Hierarchical Attention Network for Human Pose Estimation in Videos	Nov 29, 2022	2D Pose Estimation3D Human Pose Estimation	CodeCode Available	1
SliceMatch: Geometry-guided Aggregation for Cross-View Pose Estimation	Nov 26, 2022	Camera Pose EstimationContrastive Learning	CodeCode Available	1
PoET: Pose Estimation Transformer for Single-View, Multi-Object 6D Pose Estimation	Nov 25, 2022	6D Pose Estimation6D Pose Estimation using RGB	CodeCode Available	1
CAD2Render: A Modular Toolkit for GPU-accelerated Photorealistic Synthetic Data Generation for the Manufacturing Industry	Nov 25, 2022	GPUobject-detection	CodeCode Available	1
Pose-disentangled Contrastive Learning for Self-supervised Facial Representation	Nov 24, 2022	Contrastive LearningData Augmentation	CodeCode Available	1
CPPF++: Uncertainty-Aware Sim2Real Object Pose Estimation by Vote Aggregation	Nov 24, 2022	Pose Estimation	CodeCode Available	1
Level-S^2fM: Structure from Motion on Neural Level Set of Implicit Surfaces	Nov 22, 2022	3D ReconstructionCamera Pose Estimation	CodeCode Available	1
Anatomy-guided domain adaptation for 3D in-bed human pose estimation	Nov 22, 2022	3D Human Pose EstimationAnatomy	CodeCode Available	1
Simultaneous Multiple Object Detection and Pose Estimation using 3D Model Infusion with Monocular Vision	Nov 21, 2022	Autonomous DrivingObject	CodeCode Available	1
Normalizing Flows for Human Pose Anomaly Detection	Nov 20, 2022	Abnormal Event Detection In VideoAnomaly Detection	CodeCode Available	1
TAX-Pose: Task-Specific Cross-Pose Estimation for Robot Manipulation	Nov 17, 2022	Pose EstimationRobot Manipulation	CodeCode Available	1
Interacting Hand-Object Pose Estimation via Dense Mutual Attention	Nov 16, 2022	3D Hand Pose Estimationhand-object pose	CodeCode Available	1
Robust Collaborative 3D Object Detection in Presence of Pose Errors	Nov 14, 2022	3D Object DetectionObject	CodeCode Available	1
GAPartNet: Cross-Category Domain-Generalizable Object Perception and Manipulation via Generalizable and Actionable Parts	Nov 10, 2022	3D Instance SegmentationDomain Generalization	CodeCode Available	1
MEVID: Multi-view Extended Videos with Identities for Video Person Re-Identification	Nov 9, 2022	Multi-Object Trackingobject-detection	CodeCode Available	1
Bootstrapping Human Optical Flow and Pose	Oct 27, 2022	Optical Flow EstimationPose Estimation	CodeCode Available	1
THOR-Net: End-to-end Graformer-based Realistic Two Hands and Object Reconstruction with Self-supervision	Oct 25, 2022	Hand Pose EstimationObject Reconstruction	CodeCode Available	1
Video based Object 6D Pose Estimation using Transformers	Oct 24, 2022	6D Pose Estimation6D Pose Estimation using RGB	CodeCode Available	1
HuPR: A Benchmark for Human Pose Estimation Using Millimeter Wave Radar	Oct 22, 2022	2D Pose EstimationPose Estimation	CodeCode Available	1
CRT-6D: Fast 6D Object Pose Estimation with Cascaded Refinement Transformers	Oct 21, 2022	6D Pose Estimation using RGBObject	CodeCode Available	1
MEEV: Body Mesh Estimation On Egocentric Video	Oct 21, 2022	3D human pose and shape estimation3D Human Pose Estimation	CodeCode Available	1
Parallel Inversion of Neural Radiance Fields for Robust Pose Estimation	Oct 18, 2022	NeRFPose Estimation	CodeCode Available	1
Semi-supervised Body Parsing and Pose Estimation for Enhancing Infant General Movement Assessment	Oct 14, 2022	Data AugmentationGenerative Adversarial Network	CodeCode Available	1
Keypoint Cascade Voting for Point Cloud Based 6DoF Pose Estimation	Oct 14, 2022	Keypoint EstimationPose Estimation	CodeCode Available	1
DART: Articulated Hand Model with Diverse Accessories and Rich Textures	Oct 14, 2022	DiversityHand Pose Estimation	CodeCode Available	1
Self-Supervised Geometric Correspondence for Category-Level 6D Object Pose Estimation in the Wild	Oct 13, 2022	6D Pose Estimation6D Pose Estimation using RGB	CodeCode Available	1
VL4Pose: Active Learning Through Out-Of-Distribution Detection For Pose Estimation	Oct 12, 2022	Active LearningHand Pose Estimation	CodeCode Available	1
Uplift and Upsample: Efficient 3D Human Pose Estimation with Uplifting Transformers	Oct 12, 2022	2D Pose Estimation3D Human Pose Estimation	CodeCode Available	1
CASAPose: Class-Adaptive and Semantic-Aware Multi-Object Pose Estimation	Oct 11, 2022	6D Pose Estimation6D Pose Estimation using RGB	CodeCode Available	1
DCL-Net: Deep Correspondence Learning Network for 6D Pose Estimation	Oct 11, 2022	6D Pose Estimation6D Pose Estimation using RGB	CodeCode Available	1
SiNeRF: Sinusoidal Neural Radiance Fields for Joint Pose Estimation and Scene Reconstruction	Oct 10, 2022	Image GenerationNeRF	CodeCode Available	1
Spectral Geometric Verification: Re-Ranking Point Cloud Retrieval for Metric Localization	Oct 10, 2022	Point Cloud RegistrationPoint Cloud Retrieval	CodeCode Available	1
AdaptivePose++: A Powerful Single-Stage Network for Multi-Person Pose Regression	Oct 8, 2022	3D Multi-Person Pose EstimationHuman Detection	CodeCode Available	1
PCKRF: Point Cloud Completion and Keypoint Refinement With Fusion Data for 6D Pose Estimation	Oct 7, 2022	6D Pose EstimationPoint Cloud Completion	CodeCode Available	1
MBW: Multi-view Bootstrapping in the Wild	Oct 4, 2022	3D ReconstructionPose Estimation	CodeCode Available	1
Generative Category-Level Shape and Pose Estimation with Semantic Primitives	Oct 3, 2022	6D Pose Estimation using RGBDDiversity	CodeCode Available	1
Obj2Seq: Formatting Objects as Sequences with Class Prompt for Visual Tasks	Sep 28, 2022	Multi-Label ClassificationMUlTI-LABEL-ClASSIFICATION	CodeCode Available	1
Hierarchical Temporal Transformer for 3D Hand Pose Estimation and Action Recognition from Egocentric RGB Videos	Sep 20, 2022	3D Hand Pose EstimationAction Recognition	CodeCode Available	1
D&D: Learning Human Dynamics from Dynamic Camera	Sep 19, 2022	3D Human Pose EstimationHuman Dynamics	CodeCode Available	1
PPT: token-Pruned Pose Transformer for monocular and multi-view human pose estimation	Sep 16, 2022	2D Human Pose Estimation3D Human Pose Estimation	CodeCode Available	1
TempCLR: Reconstructing Hands via Time-Coherent Contrastive Learning	Sep 1, 2022	Contrastive LearningHand Pose Estimation	CodeCode Available	1
Light curve completion and forecasting using fast and scalable Gaussian processes (MuyGPs)	Aug 31, 2022	Gaussian ProcessesPose Estimation	CodeCode Available	1
6IMPOSE: Bridging the Reality Gap in 6D Pose Estimation for Robotic Grasping	Aug 30, 2022	6D Pose EstimationPose Estimation	CodeCode Available	1
PoseBERT: A Generic Transformer Module for Temporal 3D Human Modeling	Aug 22, 2022	Pose EstimationPose Prediction	CodeCode Available	1
Unifying Visual Perception by Dispersible Points Learning	Aug 18, 2022	Instance SegmentationObject	CodeCode Available	1
MoCapDeform: Monocular 3D Human Motion Capture in Deformable Scenes	Aug 17, 2022	3D Human Pose EstimationPose Estimation	CodeCode Available	1
SMPL-IK: Learned Morphology-Aware Inverse Kinematics for AI Driven Artistic Workflows	Aug 16, 2022	Pose Estimation	CodeCode Available	1
PoseTrans: A Simple Yet Effective Pose Transformation Augmentation for Human Pose Estimation	Aug 16, 2022	Data AugmentationDiversity	CodeCode Available	1
Jointformer: Single-Frame Lifting Transformer with Error Prediction and Refinement for 3D Human Pose Estimation	Aug 7, 2022	3D Human Pose EstimationMonocular 3D Human Pose Estimation	CodeCode Available	1

Show:10 25 50

← PrevPage 11 of 85Next →

All datasets COCO test-dev MPII Human Pose OCHuman Leeds Sports Poses CrowdPose COCO val2017 AIC COCO (Common Objects in Context)InLoc ITOP front-view J-HMDB MPII Single Person

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	yolopose	AP50	90.3	—	Unverified
2	ViTPose (ViTAE-G, ensemble)	AP	81.1	—	Unverified
3	ViTPose (ViTAE-G)	AP	80.9	—	Unverified
4	PoseBH-H	AP	79.5	—	Unverified
5	UDP-Pose-PSA(384x288)	AP	79.5	—	Unverified
6	4xRSN-50 (ensemble)	AP	79.2	—	Unverified
7	UDP-Pose-PSA(256x192)	AP	78.9	—	Unverified
8	CCM+	AP	78.9	—	Unverified
9	4xRSN-50	AP	78.6	—	Unverified
10	PCT (256x256)	AP	78.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PCT (swin-l, test set)	PCKh-0.5	94.3	—	Unverified
2	Soft-gated Skip Connections	PCKh-0.5	94.1	—	Unverified
3	Cascade Feature Aggregation	PCKh-0.5	93.9	—	Unverified
4	PCT (swin-b, test set)	PCKh-0.5	93.8	—	Unverified
5	TransPose	PCKh-0.5	93.5	—	Unverified
6	UniHCP (FT)	PCKh-0.5	93.2	—	Unverified
7	4xRSN-50	PCKh-0.5	93	—	Unverified
8	UniPose	PCKh-0.5	92.7	—	Unverified
9	MSPN	PCKh-0.5	92.6	—	Unverified
10	Spatial Context	PCKh-0.5	92.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ViTPose (ViTAE-G, GT bounding boxes)	Test AP	93.3	—	Unverified
2	UniHCP (direct eval)	Test AP	87.4	—	Unverified
3	PoseBH-H	Test AP	87	—	Unverified
4	RTMPose(RTMPose-l, GT bounding boxes)	Test AP	80.3	—	Unverified
5	TransPose-H	Validation AP	62.3	—	Unverified
6	BBox-Mask-Pose 2x	Test AP	48.3	—	Unverified
7	BUCTD (CID-W32)	Test AP	47.2	—	Unverified
8	HQNet (ViT-L)	Test AP	45.6	—	Unverified
9	MaskPose-b	Test AP	45	—	Unverified
10	CID (HRNet-W48)	Test AP	45	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OmniPose	PCK	99.5	—	Unverified
2	Soft-gated Skip Connections	PCK	94.8	—	Unverified
3	Residual Hourglass + ASR + AHO	PCK	94.5	—	Unverified
4	UniPose	PCK	94.5	—	Unverified
5	Chou et al. arXiv'17	PCK	94	—	Unverified
6	Pyramid Residual Modules (PRMs)	PCK	93.9	—	Unverified
7	Stacked hourglass + Inception-resnet	PCK	93.9	—	Unverified
8	Multi-Context Attention	PCK	92.6	—	Unverified
9	FPD	PCK	90.8	—	Unverified
10	Part heatmap regression (ResNet-152)	PCK	90.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BUCTD-W48 (w/cond. input from PETR, and generative sampling)	AP	78.5	—	Unverified
2	ViTPose-G	AP	78.3	—	Unverified
3	BUCTD-W48 (w/cond. input from PETR)	AP	76.7	—	Unverified
4	SwinV2-L 1K-MIM	AP	75.5	—	Unverified
5	SwinV2-B 1K-MIM	AP	74.9	—	Unverified
6	BUCTD-W48	AP	72.9	—	Unverified
7	OpenPifPaf	AP	70.5	—	Unverified
8	MIPNet (HRNet-W48)	AP	70	—	Unverified
9	KAPAO-L	AP	68.9	—	Unverified
10	KAPAO-M	AP	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CCNet (ViTPose-B_GT-bbox_256x192)	AP	78.1	—	Unverified
2	MogaNet-B (384x288)	AP	77.3	—	Unverified
3	ViTPose-B (Single-task_GT-bbox_256x192)	AP	77.3	—	Unverified
4	MogaNet-S (384x288)	AP	76.4	—	Unverified
5	Bias (HRNet_256x192)	AP	75.8	—	Unverified
6	ViTPose-B (Single-task_Det-bbox_256x192)	AP	75.8	—	Unverified
7	HRNet (256x192)	AP	75.3	—	Unverified
8	MogaNet-S (256x192)	AP	74.9	—	Unverified
9	MogaNet-T (256x192)	AP	73.2	—	Unverified
10	RLE (256x192)	AP	71.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Hulk(Finetune, ViT-L)	AP	37.1	—	Unverified
2	Hulk(Finetune, ViT-B)	AP	35.6	—	Unverified
3	HRFormer (HRFomer-B)	AP	34.4	—	Unverified
4	UniHCP (finetune)	AP	33.6	—	Unverified
5	HRNet (HRNet-w48 )	AP	33.5	—	Unverified
6	HRNet (HRNet-w32)	AP	32.3	—	Unverified
7	HRFormer (HRFomer-S)	AP	31.6	—	Unverified
8	SimpleBaseline (ResNet-152)	AP	29.9	—	Unverified
9	SimpleBaseline (ResNet-101)	AP	29.4	—	Unverified
10	SimpleBaseline (ResNet-50)	AP	28	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BUCTD (PETR, with generative sampling)	APL	83.7	—	Unverified
2	OmniPose (WASPv2)	AP	79.5	—	Unverified
3	MetaPrompt-SD	AP	79	—	Unverified
4	Hulk(Finetune, ViT-L)	AP	78.7	—	Unverified
5	BUCTD (PETR, with generative sampling)	AP	77.8	—	Unverified
6	Hulk(Finetune, ViT-B)	AP	77.5	—	Unverified
7	I²R-Net (1st stage:HRFormer-B)	AP	77.3	—	Unverified
8	PATH (Partial FT)	AP	77.1	—	Unverified
9	SOLIDER (swin-B)	AP	76.6	—	Unverified
10	PEFORMER-Xcit-dino-p8	AP	72.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GIM-DKM	DUC1-Acc@0.25m,10°	57.1	—	Unverified
2	GIM-LoFTR	DUC1-Acc@0.25m,10°	54.5	—	Unverified
3	GIM-SuperGlue	DUC1-Acc@0.25m,10°	53.5	—	Unverified
4	DKM	DUC1-Acc@0.25m,10°	51.5	—	Unverified
5	SuperGlue	DUC1-Acc@0.25m,10°	49	—	Unverified
6	LoFTR	DUC1-Acc@0.25m,10°	47.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AdaPose	Mean mAP	93.38	—	Unverified
2	DECA-D3	Mean mAP	88.75	—	Unverified
3	V2V-PoseNet	Mean mAP	88.74	—	Unverified
4	A2J	Mean mAP	88	—	Unverified
5	REN	Mean mAP	84.9	—	Unverified
6	Multi-task learning + viewpoint invariance	Mean mAP	77.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SimpleBaseline + HANet	Mean PCK@0.2	99.6	—	Unverified
2	DeciWatch	Mean PCK@0.2	99	—	Unverified
3	LSTM PM	Mean PCK@0.2	93.6	—	Unverified
4	CPM	Mean PCK@0.2	91.9	—	Unverified
5	UniTrack_i18	Mean PCK@0.2	80.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	4xRSN-50	PCKh@0.5	93	—	Unverified
2	Refine	PCKh@0.5	92.1	—	Unverified
3	EfficientPose IV	PCKh@0.5	91.2	—	Unverified
4	OpenPose	PCKh@0.5	88.8	—	Unverified
5	Adversarial Learning	PCKh@0.5	88.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OmniPose	Mean PCK@0.2	99.4	—	Unverified
2	UniPose-LSTM	Mean PCK@0.2	99.3	—	Unverified
3	LSTM PM	Mean PCK@0.2	97.7	—	Unverified
4	Thin-Slicing	Mean PCK@0.2	96.5	—	Unverified
5	Iqbal et al.	Mean PCK@0.2	81.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DP-RCNN-DeepLab (ResNet-101)	AP	68	—	Unverified