SOTAVerified

Pose Estimation

Pose Estimation is a computer vision task where the goal is to detect the position and orientation of a person or an object. Usually, this is done by predicting the location of specific keypoints like hands, head, elbows, etc. in case of Human Pose Estimation.

A common benchmark for this task is MPII Human Pose

( Image credit: Real-time 2D Multi-Person Pose Estimation on CPU: Lightweight OpenPose )

Papers

Showing 201250 of 4228 papers

TitleStatusHype
MonoDiff9D: Monocular Category-Level 9D Object Pose Estimation via Diffusion ModelCode1
GraspClutter6D: A Large-scale Real-world Dataset for Robust Perception and Grasping in Cluttered ScenesCode1
Learning Affine Correspondences by Integrating Geometric ConstraintsCode1
PicoPose: Progressive Pixel-to-Pixel Correspondence Learning for Novel Object Pose EstimationCode1
Recurrent Feature Mining and Keypoint Mixup Padding for Category-Agnostic Pose EstimationCode1
Analyzing the Synthetic-to-Real Domain Gap in 3D Hand Pose EstimationCode1
DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera ScenariosCode1
TrackID3x3: A Dataset and Algorithm for Multi-Player Tracking with Identification and Pose Estimation in 3x3 Basketball Full-court VideosCode1
Probabilistic Prompt Distribution Learning for Animal Pose EstimationCode1
EdgeRegNet: Edge Feature-based Multimodal Registration Network between Images and LiDAR Point CloudsCode1
GIVEPose: Gradual Intra-class Variation Elimination for RGB-based Category-Level Object Pose EstimationCode1
Novel Object 6D Pose Estimation with a Single Reference ViewCode1
DQO-MAP: Dual Quadrics Multi-Object mapping with Gaussian SplattingCode1
Convex Hull-based Algebraic Constraint for Visual Quadric SLAMCode1
EVLoc: Event-based Visual Localization in LiDAR Maps via Event-Depth RegistrationCode1
BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket SportsCode1
SegLocNet: Multimodal Localization Network for Autonomous Driving via Bird's-Eye-View SegmentationCode1
Learning Structure-Supporting Dependencies via Keypoint Interactive Transformer for General Mammal Pose EstimationCode1
SiMHand: Mining Similar Hands for Large-Scale 3D Hand Pose Pre-trainingCode1
CoDiff: Conditional Diffusion Model for Collaborative 3D Object DetectionCode1
Manual2Skill: Learning to Read Manuals and Acquire Robotic Skills for Furniture Assembly Using Vision-Language ModelsCode1
XRF V2: A Dataset for Action Summarization with Wi-Fi Signals, and IMUs in Phones, Watches, Earbuds, and GlassesCode1
3D/2D Registration of Angiograms using Silhouette-based Differentiable RenderingCode1
EgoHand: Ego-centric Hand Pose Estimation and Gesture Recognition with Head-mounted Millimeter-wave Radar and IMUsCode1
landmarker: a Toolkit for Anatomical Landmark Localization in 2D/3D ImagesCode1
Towards Robust and Realistic Human Pose Estimation via WiFi SignalsCode1
Poseidon: A ViT-based Architecture for Multi-Frame Pose Estimation with Adaptive Frame Weighting and Multi-Scale Feature FusionCode1
RePoseD: Efficient Relative Pose Estimation With Known Depth InformationCode1
Learning to Filter Outlier Edges in Global SfMCode1
PIDLoc: Cross-View Pose Optimization Network Inspired by PID ControllersCode1
Pre-training a Density-Aware Pose Transformer for Robust LiDAR-based 3D Human Pose EstimationCode1
Enhancing Scene Coordinate Regression with Efficient Keypoint Detection and Sequential InformationCode1
Particle-based 6D Object Pose Estimation from Point Clouds using Diffusion ModelsCode1
Multiview Equivariance Improves 3D Correspondence Understanding with Minimal Feature FinetuningCode1
RoboPEPP: Vision-Based Robot Pose and Joint Angle Estimation through Embedding Predictive Pre-TrainingCode1
Edge Weight Prediction For Category-Agnostic Pose EstimationCode1
Generalizable Single-view Object Pose Estimation by Two-side Generating and MatchingCode1
X as Supervision: Contending with Depth Ambiguity in Unsupervised Monocular 3D Pose EstimationCode1
Activating Self-Attention for Multi-Scene Absolute Pose RegressionCode1
SCRREAM : SCan, Register, REnder And Map:A Framework for Annotating Accurate and Dense 3D Indoor Scenes with a BenchmarkCode1
EI-Nexus: Towards Unmediated and Flexible Inter-Modality Local Feature Extraction and Matching for Event-Image DataCode1
BLAPose: Enhancing 3D Human Pose Estimation with Bone Length AdjustmentCode1
ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from VideosCode1
Towards Multi-Modal Animal Pose Estimation: A Survey and In-Depth AnalysisCode1
Optimal-state Dynamics Estimation for Physics-based Human Motion Capture from VideosCode1
Why Sample Space Matters: Keyframe Sampling Optimization for LiDAR-based Place RecognitionCode1
RAD: A Dataset and Benchmark for Real-Life Anomaly Detection with Robotic ObservationsCode1
PuzzleBoard: A New Camera Calibration Pattern with Position EncodingCode1
Omni6D: Large-Vocabulary 3D Object Dataset for Category-Level 6D Object Pose EstimationCode1
Leveraging Anthropometric Measurements to Improve Human Mesh Estimation and Ensure Consistent Body ShapesCode1
Show:102550
← PrevPage 5 of 85Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1yoloposeAP5090.3Unverified
2ViTPose (ViTAE-G, ensemble)AP81.1Unverified
3ViTPose (ViTAE-G)AP80.9Unverified
4UDP-Pose-PSA(384x288)AP79.5Unverified
5PoseBH-HAP79.5Unverified
64xRSN-50 (ensemble)AP79.2Unverified
7UDP-Pose-PSA(256x192)AP78.9Unverified
8CCM+AP78.9Unverified
94xRSN-50AP78.6Unverified
10PCT (256x256)AP78.3Unverified
#ModelMetricClaimedVerifiedStatus
1PCT (swin-l, test set)PCKh-0.594.3Unverified
2Soft-gated Skip ConnectionsPCKh-0.594.1Unverified
3Cascade Feature AggregationPCKh-0.593.9Unverified
4PCT (swin-b, test set)PCKh-0.593.8Unverified
5TransPosePCKh-0.593.5Unverified
6UniHCP (FT)PCKh-0.593.2Unverified
74xRSN-50PCKh-0.593Unverified
8UniPosePCKh-0.592.7Unverified
9MSPNPCKh-0.592.6Unverified
10Spatial ContextPCKh-0.592.5Unverified
#ModelMetricClaimedVerifiedStatus
1ViTPose (ViTAE-G, GT bounding boxes)Test AP93.3Unverified
2UniHCP (direct eval)Test AP87.4Unverified
3PoseBH-HTest AP87Unverified
4RTMPose(RTMPose-l, GT bounding boxes)Test AP80.3Unverified
5TransPose-HValidation AP62.3Unverified
6BBox-Mask-Pose 2xTest AP48.3Unverified
7BUCTD (CID-W32)Test AP47.2Unverified
8HQNet (ViT-L)Test AP45.6Unverified
9CID (HRNet-W48)Test AP45Unverified
10MaskPose-bTest AP45Unverified
#ModelMetricClaimedVerifiedStatus
1OmniPosePCK99.5Unverified
2Soft-gated Skip ConnectionsPCK94.8Unverified
3UniPosePCK94.5Unverified
4Residual Hourglass + ASR + AHOPCK94.5Unverified
5Chou et al. arXiv'17PCK94Unverified
6Pyramid Residual Modules (PRMs)PCK93.9Unverified
7Stacked hourglass + Inception-resnetPCK93.9Unverified
8Multi-Context AttentionPCK92.6Unverified
9FPDPCK90.8Unverified
10Part heatmap regression (ResNet-152)PCK90.7Unverified
#ModelMetricClaimedVerifiedStatus
1BUCTD-W48 (w/cond. input from PETR, and generative sampling)AP78.5Unverified
2ViTPose-GAP78.3Unverified
3BUCTD-W48 (w/cond. input from PETR)AP76.7Unverified
4SwinV2-L 1K-MIMAP75.5Unverified
5SwinV2-B 1K-MIMAP74.9Unverified
6BUCTD-W48AP72.9Unverified
7OpenPifPafAP70.5Unverified
8MIPNet (HRNet-W48)AP70Unverified
9KAPAO-LAP68.9Unverified
10KAPAO-MAP67.1Unverified
#ModelMetricClaimedVerifiedStatus
1CCNet (ViTPose-B_GT-bbox_256x192)AP78.1Unverified
2MogaNet-B (384x288)AP77.3Unverified
3ViTPose-B (Single-task_GT-bbox_256x192)AP77.3Unverified
4MogaNet-S (384x288)AP76.4Unverified
5Bias (HRNet_256x192)AP75.8Unverified
6ViTPose-B (Single-task_Det-bbox_256x192)AP75.8Unverified
7HRNet (256x192)AP75.3Unverified
8MogaNet-S (256x192)AP74.9Unverified
9MogaNet-T (256x192)AP73.2Unverified
10RLE (256x192)AP71.3Unverified
#ModelMetricClaimedVerifiedStatus
1Hulk(Finetune, ViT-L)AP37.1Unverified
2Hulk(Finetune, ViT-B)AP35.6Unverified
3HRFormer (HRFomer-B)AP34.4Unverified
4UniHCP (finetune)AP33.6Unverified
5HRNet (HRNet-w48 )AP33.5Unverified
6HRNet (HRNet-w32)AP32.3Unverified
7HRFormer (HRFomer-S)AP31.6Unverified
8SimpleBaseline (ResNet-152)AP29.9Unverified
9SimpleBaseline (ResNet-101)AP29.4Unverified
10SimpleBaseline (ResNet-50)AP28Unverified
#ModelMetricClaimedVerifiedStatus
1BUCTD (PETR, with generative sampling)APL83.7Unverified
2OmniPose (WASPv2)AP79.5Unverified
3MetaPrompt-SDAP79Unverified
4Hulk(Finetune, ViT-L)AP78.7Unverified
5BUCTD (PETR, with generative sampling)AP77.8Unverified
6Hulk(Finetune, ViT-B)AP77.5Unverified
7I²R-Net (1st stage:HRFormer-B)AP77.3Unverified
8PATH (Partial FT)AP77.1Unverified
9SOLIDER (swin-B)AP76.6Unverified
10PEFORMER-Xcit-dino-p8AP72.6Unverified
#ModelMetricClaimedVerifiedStatus
1GIM-DKMDUC1-Acc@0.25m,10°57.1Unverified
2GIM-LoFTRDUC1-Acc@0.25m,10°54.5Unverified
3GIM-SuperGlueDUC1-Acc@0.25m,10°53.5Unverified
4DKMDUC1-Acc@0.25m,10°51.5Unverified
5SuperGlueDUC1-Acc@0.25m,10°49Unverified
6LoFTRDUC1-Acc@0.25m,10°47.5Unverified
#ModelMetricClaimedVerifiedStatus
1AdaPoseMean mAP93.38Unverified
2DECA-D3Mean mAP88.75Unverified
3V2V-PoseNetMean mAP88.74Unverified
4A2JMean mAP88Unverified
5RENMean mAP84.9Unverified
6Multi-task learning + viewpoint invarianceMean mAP77.4Unverified
#ModelMetricClaimedVerifiedStatus
1SimpleBaseline + HANetMean PCK@0.299.6Unverified
2DeciWatchMean PCK@0.299Unverified
3LSTM PMMean PCK@0.293.6Unverified
4CPMMean PCK@0.291.9Unverified
5UniTrack_i18Mean PCK@0.280.5Unverified
#ModelMetricClaimedVerifiedStatus
14xRSN-50PCKh@0.593Unverified
2RefinePCKh@0.592.1Unverified
3EfficientPose IVPCKh@0.591.2Unverified
4OpenPosePCKh@0.588.8Unverified
5Adversarial LearningPCKh@0.588.6Unverified
#ModelMetricClaimedVerifiedStatus
1OmniPoseMean PCK@0.299.4Unverified
2UniPose-LSTMMean PCK@0.299.3Unverified
3LSTM PMMean PCK@0.297.7Unverified
4Thin-SlicingMean PCK@0.296.5Unverified
5Iqbal et al.Mean PCK@0.281.1Unverified
#ModelMetricClaimedVerifiedStatus
1DP-RCNN-DeepLab (ResNet-101)AP68Unverified