Pose Estimation

Pose Estimation is a computer vision task where the goal is to detect the position and orientation of a person or an object. Usually, this is done by predicting the location of specific keypoints like hands, head, elbows, etc. in case of Human Pose Estimation.

A common benchmark for this task is MPII Human Pose

( Image credit: Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose )

Papers

Recently Added Most Hyped Most Active Needs Verification Most Verified

Showing 2101–2150 of 4228 papers

Title	Date	Tasks	Status
Category-level Object Detection, Pose Estimation and Reconstruction from Stereo Images	Jul 9, 2024	DecoderObject	—Unverified
Category Level Object Pose Estimation via Neural Analysis-by-Synthesis	Aug 18, 2020	Image GenerationObject	—Unverified
Causal-Inspired Multitask Learning for Video-Based Human Pose Estimation	Jan 24, 2025	Pose Estimation	—Unverified
C-BEV: Contrastive Bird's Eye View Training for Cross-View Image Retrieval and 3-DoF Pose Estimation	Dec 13, 2023	Image RetrievalPose Estimation	—Unverified
CCS: Continuous Learning for Customized Incremental Wireless Sensing Services	Dec 6, 2024	Action RecognitionKnowledge Distillation	—Unverified
CDPN: Coordinates-Based Disentangled Pose Network for Real-Time RGB-Based 6-DoF Object Pose Estimation	Oct 1, 2019	6D Pose Estimation using RGBPose Estimation	—Unverified
CenDerNet: Center and Curvature Representations for Render-and-Compare 6D Pose Estimation	Aug 21, 2022	6D Pose EstimationObject	—Unverified
Center-Based Decoupled Point-cloud Registration for 6D Object Pose Estimation	Jan 1, 2023	6D Pose Estimation using RGBObject	—Unverified
CenterGrasp: Object-Aware Implicit Representation Learning for Simultaneous Shape Reconstruction and 6-DoF Grasp Estimation	Dec 13, 2023	DecoderObject	—Unverified
Certifiable Relative Pose Estimation	Mar 30, 2020	Pose Estimation	—Unverified
Chained Predictions Using Convolutional Neural Networks	May 8, 2016	Pose Estimation	—Unverified
ChaLearn Looking at People: Inpainting and Denoising challenges	Jun 24, 2021	DenoisingPose Estimation	—Unverified
Challenges for Monocular 6D Object Pose Estimation in Robotics	Jul 22, 2023	6D Pose Estimation using RGBObject	—Unverified
ChiNet: Deep Recurrent Convolutional Learning for Multimodal Spacecraft Pose Estimation	Aug 23, 2021	Pose EstimationSpacecraft Pose Estimation	—Unverified
CHIP: A multi-sensor dataset for 6D pose estimation of chairs in industrial settings	Jun 11, 2025	6D Pose EstimationPose Estimation	—Unverified
Cinematic Behavior Transfer via NeRF-based Differentiable Filming	Nov 29, 2023	NeRFPose Estimation	—Unverified
CLA-NeRF: Category-Level Articulated Neural Radiance Field	Feb 1, 2022	Inverse RenderingNeRF	—Unverified
Class Generative Models Based on Feature Regression for Pose Estimation of Object Categories	Jun 1, 2013	Pose Estimationregression	—Unverified
Classification of Phonological Parameters in Sign Languages	May 24, 2022	ClassificationPose Estimation	—Unverified
Classroom-Inspired Multi-Mentor Distillation with Adaptive Learning Strategies	Sep 30, 2024	2D Human Pose Estimationimage-classification	—Unverified
CLERF: Contrastive LEaRning for Full Range Head Pose Estimation	Dec 3, 2024	Contrastive LearningHead Pose Estimation	—Unverified
CLIP-Clique: Graph-based Correspondence Matching Augmented by Vision Language Models for Object-based Global Localization	Oct 4, 2024	Graph MatchingPose Estimation	—Unverified
CLIP-Hand3D: Exploiting 3D Hand Pose Estimation via Context-Aware Prompting	Sep 28, 2023	3D Hand Pose EstimationContrastive Learning	—Unverified
CLIPose: Category-Level Object Pose Estimation with Pre-trained Vision-Language Knowledge	Feb 24, 2024	Contrastive LearningLanguage Modelling	—Unverified
Cloth2Body: Generating 3D Human Body Mesh from 2D Clothing	Sep 28, 2023	Human Mesh RecoveryPose Estimation	—Unverified
Clothes-Changing Person Re-identification Based On Skeleton Dynamics	Mar 13, 2025	Clothes Changing Person Re-IdentificationPerson Re-Identification	—Unverified
ClothPose: A Real-world Benchmark for Visual Analysis of Garment Pose via An Indirect Recording Solution	Jan 1, 2023	2kPose Estimation	—Unverified
CloTH-VTON+: Clothing Three-dimensional reconstruction for Hybrid image-based Virtual Try-ON	Feb 16, 2021	Pose EstimationVirtual Try-on	—Unverified
Clustering-based Learning for UAV Tracking and Pose Estimation	May 27, 2024	ClusteringPose Estimation	—Unverified
CMRNext: Camera to LiDAR Matching in the Wild for Localization and Extrinsic Calibration	Jan 31, 2024	Optical Flow EstimationPose Estimation	—Unverified
CMS-RCNN: Contextual Multi-Scale Region-based CNN for Unconstrained Face Detection	Jun 17, 2016	Face DetectionFace Recognition	—Unverified
CNN-Based Action Recognition and Pose Estimation for Classifying Animal Behavior from Videos: A Survey	Jan 15, 2023	Action RecognitionPose Estimation	—Unverified
CNN Based Flank Predictor for Quadruped Animal Species	Jun 19, 2024	Animal Pose Estimationimage-classification	—Unverified
CNN-based real-time 2D-3D deformable registration from a single X-ray projection	Dec 15, 2022	AnatomyMixed Reality	—Unverified
Coarse-to-Fine for Sim-to-Real: Sub-Millimetre Precision Across Wide Task Spaces	May 24, 2021	Motion PlanningPose Estimation	—Unverified
Coarse-to-fine Semantic Localization with HD Map for Autonomous Driving in Structural Scenes	Jul 6, 2021	Autonomous DrivingPose Estimation	—Unverified
COBRA -- COnfidence score Based on shape Regression Analysis for method-independent quality assessment of object pose estimation from single images	Apr 25, 2024	Gaussian ProcessesObject	—Unverified
CodeVIO: Visual-Inertial Odometry with Learned Optimizable Dense Depth	Dec 18, 2020	Depth EstimationDepth Prediction	—Unverified
Coinbot: Intelligent Robotic Coin Bag Manipulation Using Deep Reinforcement Learning And Machine Teaching	Dec 2, 2020	Deep Reinforcement LearningMotion Planning	—Unverified
Collaborative Learning for 3D Hand-Object Reconstruction and Compositional Action Recognition from Egocentric RGB Videos Using Superquadrics	Jan 13, 2025	Action Recognitionhand-object pose	—Unverified
Collaborative Learning for Hand and Object Reconstruction with Attention-guided Graph Convolution	Apr 27, 2022	3D Hand Pose Estimation3D Pose Estimation	—Unverified
Collaborative Learning of Gesture Recognition and 3D Hand Pose Estimation with Multi-Order Feature Analysis	Aug 1, 2020	3D Hand Pose EstimationGesture Recognition	—Unverified
Collaboratively Self-supervised Video Representation Learning for Action Recognition	Jan 15, 2024	Action RecognitionPose Estimation	—Unverified
Combining 3D Model Contour Energy and Keypoints for Object Tracking	Feb 4, 2020	Object TrackingPose Estimation	—Unverified
Combining Absolute and Semi-Generalized Relative Poses for Visual Localization	Sep 21, 2024	Pose EstimationVisual Localization	—Unverified
Combining Deep and Depth: Deep Learning and Face Depth Maps for Driver Attention Monitoring	Dec 14, 2018	Deep LearningDriver Attention Monitoring	—Unverified
Combining detection and tracking for human pose estimation in videos	Mar 30, 2020	Pose EstimationPose Tracking	—Unverified
Combining Efficient and Precise Sign Language Recognition: Good pose estimation library is all you need	Sep 30, 2022	AllGPU	—Unverified
Combining Local and Global Pose Estimation for Precise Tracking of Similar Objects	Jan 31, 2022	GPUObject	—Unverified
Combining RGB and Points to Predict Grasping Region for Robotic Bin-Picking	Apr 16, 2019	DiversityPose Estimation	—Unverified

Show:10 25 50

← PrevPage 43 of 85Next →

All datasets COCO test-dev MPII Human Pose OCHuman Leeds Sports Poses CrowdPose COCO val2017 AIC COCO (Common Objects in Context)InLoc ITOP front-view J-HMDB MPII Single Person

Benchmark Results

#	Model	Metric	Claimed	Verified	Status
1	yolopose	AP50	90.3	—	Unverified
2	ViTPose (ViTAE-G, ensemble)	AP	81.1	—	Unverified
3	ViTPose (ViTAE-G)	AP	80.9	—	Unverified
4	PoseBH-H	AP	79.5	—	Unverified
5	UDP-Pose-PSA(384x288)	AP	79.5	—	Unverified
6	4xRSN-50 (ensemble)	AP	79.2	—	Unverified
7	UDP-Pose-PSA(256x192)	AP	78.9	—	Unverified
8	CCM+	AP	78.9	—	Unverified
9	4xRSN-50	AP	78.6	—	Unverified
10	PCT (256x256)	AP	78.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	PCT (swin-l, test set)	PCKh-0.5	94.3	—	Unverified
2	Soft-gated Skip Connections	PCKh-0.5	94.1	—	Unverified
3	Cascade Feature Aggregation	PCKh-0.5	93.9	—	Unverified
4	PCT (swin-b, test set)	PCKh-0.5	93.8	—	Unverified
5	TransPose	PCKh-0.5	93.5	—	Unverified
6	UniHCP (FT)	PCKh-0.5	93.2	—	Unverified
7	4xRSN-50	PCKh-0.5	93	—	Unverified
8	UniPose	PCKh-0.5	92.7	—	Unverified
9	MSPN	PCKh-0.5	92.6	—	Unverified
10	Spatial Context	PCKh-0.5	92.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	ViTPose (ViTAE-G, GT bounding boxes)	Test AP	93.3	—	Unverified
2	UniHCP (direct eval)	Test AP	87.4	—	Unverified
3	PoseBH-H	Test AP	87	—	Unverified
4	RTMPose(RTMPose-l, GT bounding boxes)	Test AP	80.3	—	Unverified
5	TransPose-H	Validation AP	62.3	—	Unverified
6	BBox-Mask-Pose 2x	Test AP	48.3	—	Unverified
7	BUCTD (CID-W32)	Test AP	47.2	—	Unverified
8	HQNet (ViT-L)	Test AP	45.6	—	Unverified
9	MaskPose-b	Test AP	45	—	Unverified
10	CID (HRNet-W48)	Test AP	45	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OmniPose	PCK	99.5	—	Unverified
2	Soft-gated Skip Connections	PCK	94.8	—	Unverified
3	Residual Hourglass + ASR + AHO	PCK	94.5	—	Unverified
4	UniPose	PCK	94.5	—	Unverified
5	Chou et al. arXiv'17	PCK	94	—	Unverified
6	Pyramid Residual Modules (PRMs)	PCK	93.9	—	Unverified
7	Stacked hourglass + Inception-resnet	PCK	93.9	—	Unverified
8	Multi-Context Attention	PCK	92.6	—	Unverified
9	FPD	PCK	90.8	—	Unverified
10	Part heatmap regression (ResNet-152)	PCK	90.7	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BUCTD-W48 (w/cond. input from PETR, and generative sampling)	AP	78.5	—	Unverified
2	ViTPose-G	AP	78.3	—	Unverified
3	BUCTD-W48 (w/cond. input from PETR)	AP	76.7	—	Unverified
4	SwinV2-L 1K-MIM	AP	75.5	—	Unverified
5	SwinV2-B 1K-MIM	AP	74.9	—	Unverified
6	BUCTD-W48	AP	72.9	—	Unverified
7	OpenPifPaf	AP	70.5	—	Unverified
8	MIPNet (HRNet-W48)	AP	70	—	Unverified
9	KAPAO-L	AP	68.9	—	Unverified
10	KAPAO-M	AP	67.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	CCNet (ViTPose-B_GT-bbox_256x192)	AP	78.1	—	Unverified
2	MogaNet-B (384x288)	AP	77.3	—	Unverified
3	ViTPose-B (Single-task_GT-bbox_256x192)	AP	77.3	—	Unverified
4	MogaNet-S (384x288)	AP	76.4	—	Unverified
5	Bias (HRNet_256x192)	AP	75.8	—	Unverified
6	ViTPose-B (Single-task_Det-bbox_256x192)	AP	75.8	—	Unverified
7	HRNet (256x192)	AP	75.3	—	Unverified
8	MogaNet-S (256x192)	AP	74.9	—	Unverified
9	MogaNet-T (256x192)	AP	73.2	—	Unverified
10	RLE (256x192)	AP	71.3	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	Hulk(Finetune, ViT-L)	AP	37.1	—	Unverified
2	Hulk(Finetune, ViT-B)	AP	35.6	—	Unverified
3	HRFormer (HRFomer-B)	AP	34.4	—	Unverified
4	UniHCP (finetune)	AP	33.6	—	Unverified
5	HRNet (HRNet-w48 )	AP	33.5	—	Unverified
6	HRNet (HRNet-w32)	AP	32.3	—	Unverified
7	HRFormer (HRFomer-S)	AP	31.6	—	Unverified
8	SimpleBaseline (ResNet-152)	AP	29.9	—	Unverified
9	SimpleBaseline (ResNet-101)	AP	29.4	—	Unverified
10	SimpleBaseline (ResNet-50)	AP	28	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	BUCTD (PETR, with generative sampling)	APL	83.7	—	Unverified
2	OmniPose (WASPv2)	AP	79.5	—	Unverified
3	MetaPrompt-SD	AP	79	—	Unverified
4	Hulk(Finetune, ViT-L)	AP	78.7	—	Unverified
5	BUCTD (PETR, with generative sampling)	AP	77.8	—	Unverified
6	Hulk(Finetune, ViT-B)	AP	77.5	—	Unverified
7	I²R-Net (1st stage:HRFormer-B)	AP	77.3	—	Unverified
8	PATH (Partial FT)	AP	77.1	—	Unverified
9	SOLIDER (swin-B)	AP	76.6	—	Unverified
10	PEFORMER-Xcit-dino-p8	AP	72.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	GIM-DKM	DUC1-Acc@0.25m,10°	57.1	—	Unverified
2	GIM-LoFTR	DUC1-Acc@0.25m,10°	54.5	—	Unverified
3	GIM-SuperGlue	DUC1-Acc@0.25m,10°	53.5	—	Unverified
4	DKM	DUC1-Acc@0.25m,10°	51.5	—	Unverified
5	SuperGlue	DUC1-Acc@0.25m,10°	49	—	Unverified
6	LoFTR	DUC1-Acc@0.25m,10°	47.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	AdaPose	Mean mAP	93.38	—	Unverified
2	DECA-D3	Mean mAP	88.75	—	Unverified
3	V2V-PoseNet	Mean mAP	88.74	—	Unverified
4	A2J	Mean mAP	88	—	Unverified
5	REN	Mean mAP	84.9	—	Unverified
6	Multi-task learning + viewpoint invariance	Mean mAP	77.4	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	SimpleBaseline + HANet	Mean PCK@0.2	99.6	—	Unverified
2	DeciWatch	Mean PCK@0.2	99	—	Unverified
3	LSTM PM	Mean PCK@0.2	93.6	—	Unverified
4	CPM	Mean PCK@0.2	91.9	—	Unverified
5	UniTrack_i18	Mean PCK@0.2	80.5	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	4xRSN-50	PCKh@0.5	93	—	Unverified
2	Refine	PCKh@0.5	92.1	—	Unverified
3	EfficientPose IV	PCKh@0.5	91.2	—	Unverified
4	OpenPose	PCKh@0.5	88.8	—	Unverified
5	Adversarial Learning	PCKh@0.5	88.6	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	OmniPose	Mean PCK@0.2	99.4	—	Unverified
2	UniPose-LSTM	Mean PCK@0.2	99.3	—	Unverified
3	LSTM PM	Mean PCK@0.2	97.7	—	Unverified
4	Thin-Slicing	Mean PCK@0.2	96.5	—	Unverified
5	Iqbal et al.	Mean PCK@0.2	81.1	—	Unverified

#	Model	Metric	Claimed	Verified	Status
1	DP-RCNN-DeepLab (ResNet-101)	AP	68	—	Unverified