SOTAVerified

3D Human Pose Estimation

3D Human Pose Estimation is a computer vision task that involves estimating the 3D positions and orientations of body joints and bones from 2D images or videos. The goal is to reconstruct the 3D pose of a person in real-time, which can be used in a variety of applications, such as virtual reality, human-computer interaction, and motion analysis.

Papers

Showing 150 of 665 papers

TitleStatusHype
BlazePose: On-device Real-time Body Pose trackingCode4
TRAM: Global Trajectory and Motion of 3D Humans from in-the-wild VideosCode4
Humans in 4D: Reconstructing and Tracking Humans with TransformersCode3
WHAM: Reconstructing World-grounded Humans with Accurate 3D MotionCode3
MotionBERT: A Unified Perspective on Learning Human Motion RepresentationsCode3
HybrIK-X: Hybrid Analytical-Neural Inverse Kinematics for Whole-body Mesh RecoveryCode3
Multi-HMR: Multi-Person Whole-Body Human Mesh Recovery in a Single ShotCode3
PromptHMR: Promptable Human Mesh RecoveryCode3
TRACE: 5D Temporal Regression of Avatars with Dynamic Cameras in 3D EnvironmentsCode3
PIFuHD: Multi-Level Pixel-Aligned Implicit Function for High-Resolution 3D Human DigitizationCode3
SMPLer-X: Scaling Up Expressive Human Pose and Shape EstimationCode3
TokenHMR: Advancing Human Mesh Recovery with a Tokenized Pose RepresentationCode3
PoseFormerV2: Exploring Frequency Domain for Efficient and Robust 3D Human Pose EstimationCode2
A Survey on 3D Egocentric Human Pose EstimationCode2
Not All Tokens Are Equal: Human-centric Visual Analysis via Token Clustering TransformerCode2
3D Human Mesh Estimation from Virtual MarkersCode2
TCPFormer: Learning Temporal Correlation with Implicit Pose Proxy for 3D Human Pose EstimationCode2
NIKI: Neural Inverse Kinematics with Invertible Neural Networks for 3D Human Pose and Shape EstimationCode2
One-Stage 3D Whole-Body Mesh Recovery with Component Aware TransformerCode2
PoseTriplet: Co-evolving 3D Human Pose Estimation, Imitation, and Hallucination under Self-supervisionCode2
MixSTE: Seq2seq Mixed Spatio-Temporal Encoder for 3D Human Pose Estimation in VideoCode2
PoseMamba: Monocular 3D Human Pose Estimation with Bidirectional Global-Local Spatio-Temporal State Space ModelCode2
SMPLer: Taming Transformers for Monocular 3D Human Shape and Pose EstimationCode2
STAF: 3D Human Mesh Recovery from Video with Spatio-Temporal Alignment FusionCode2
KTPFormer: Kinematics and Trajectory Prior Knowledge-Enhanced Transformer for 3D Human Pose EstimationCode2
ICON: Implicit Clothed humans Obtained from NormalsCode2
Learning 3D Human Pose Estimation from Dozens of Datasets using a Geometry-Aware Autoencoder to Bridge Between Skeleton FormatsCode2
AthletePose3D: A Benchmark Dataset for 3D Human Pose Estimation and Kinematic Validation in Athletic MovementsCode2
Hulk: A Universal Knowledge Translator for Human-Centric TasksCode2
Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose EstimationCode2
HybrIK: A Hybrid Analytical-Neural Inverse Kinematics Solution for 3D Human Pose and Shape EstimationCode2
ETCH: Generalizing Body Fitting to Clothed Humans via Equivariant TightnessCode2
FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion ModelsCode2
DensePose From WiFiCode2
Monocular, One-stage, Regression of Multiple 3D PeopleCode2
H3WB: Human3.6M 3D WholeBody Dataset and BenchmarkCode2
Deep learning for 3D human pose estimation and mesh recovery: A surveyCode2
Human-Art: A Versatile Human-Centric Dataset Bridging Natural and Artificial ScenesCode2
MogaNet: Multi-order Gated Aggregation NetworkCode2
FrankMocap: A Monocular 3D Whole-Body Pose Estimation System via Regression and IntegrationCode2
PyMAF-X: Towards Well-aligned Full-body Model Regression from Monocular ImagesCode2
MotionAGFormer: Enhancing 3D Human Pose Estimation with a Transformer-GCNFormer NetworkCode2
Attention Mechanism Exploits Temporal Contexts: Real-Time 3D Human Pose ReconstructionCode1
PyMAF: 3D Human Pose and Shape Regression with Pyramidal Mesh Alignment Feedback LoopCode1
AutoGen: Enabling Next-Gen LLM Applications via Multi-Agent ConversationCode1
3D Human Motion Estimation via Motion Compression and RefinementCode1
Cyclic Test-Time Adaptation on Monocular Video for 3D Human Mesh ReconstructionCode1
D&D: Learning Human Dynamics from Dynamic CameraCode1
ARTS: Semi-Analytical Regressor using Disentangled Skeletal Representations for Human Mesh Recovery from VideosCode1
3D Human Mesh Regression with Dense CorrespondenceCode1
Show:102550
← PrevPage 1 of 14Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1Simple-baselinePA-MPJPE157Unverified
2HMRMPJPE130Unverified
3BMPMPVPE119.3Unverified
4SPINMPVPE116.4Unverified
5Wenshuo et a;.MPVPE112.6Unverified
6TCMR (T=16 w/o 3DPW)MPVPE111.5Unverified
7CHOMPMPVPE110.1Unverified
8PC-HMRMPVPE108.6Unverified
93DCrowdNetMPVPE108.5Unverified
10SMPLifyPA-MPJPE106.8Unverified
#ModelMetricClaimedVerifiedStatus
1VNect (Augm.)MPJPE124.7Unverified
2HMRMPJPE124.2Unverified
3Single-Shot Multi-PersonMPJPE122.2Unverified
4MehtaMPJPE117.6Unverified
5PONetMPJPE115Unverified
6Pose Consensus (monocular)MPJPE112.1Unverified
7GeoRep (fully-supervised)MPJPE110.8Unverified
8XFormer (HRNet)MPJPE109.8Unverified
9EpipolarPose (fully-supervised)MPJPE108.99Unverified
10SPINMPJPE105.2Unverified