SOTAVerified

Pose Tracking

Pose Tracking is the task of estimating multi-person human poses in videos and assigning unique instance IDs for each keypoint across frames. Accurate estimation of human keypoint-trajectories is useful for human action recognition, human interaction understanding, motion capture and animation.

Source: LightTrack: A Generic Framework for Online Top-Down Human Pose Tracking

Papers

Showing 150 of 191 papers

TitleStatusHype
FoundationPose: Unified 6D Pose Estimation and Tracking of Novel ObjectsCode4
BlazePose: On-device Real-time Body Pose trackingCode4
Keypoint Promptable Re-IdentificationCode3
Humans in 4D: Reconstructing and Tracking Humans with TransformersCode3
BundleSDF: Neural 6-DoF Tracking and 3D Reconstruction of Unknown ObjectsCode3
RGBTrack: Fast, Robust Depth-Free 6D Pose Estimation and TrackingCode2
ESVO2: Direct Visual-Inertial Odometry with Stereo Event CamerasCode2
IMU-Aided Event-based Stereo Visual OdometryCode2
AvatarPoser: Articulated Full-Body Pose Tracking from Sparse Motion SensingCode2
Structure PLP-SLAM: Efficient Sparse Mapping and Localization using Point, Line and Plane for Monocular, RGB-D and Stereo CamerasCode2
Keypoint-Based Category-Level Object Pose Tracking from an RGB Sequence with Uncertainty EstimationCode2
You Only Demonstrate Once: Category-Level Manipulation from Single Visual DemonstrationCode2
SpikeVideoFormer: An Efficient Spike-Driven Video Transformer with Hamming Attention and O(T) ComplexityCode1
Embracing Dynamics: Dynamics-aware 4D Gaussian Splatting SLAMCode1
DynOPETs: A Versatile Benchmark for Dynamic Object Pose Estimation and Tracking in Moving Camera ScenariosCode1
GS-EVT: Cross-Modal Event Camera Tracking based on Gaussian SplattingCode1
SRPose: Two-view Relative Pose Estimation with Sparse KeypointsCode1
HO-Cap: A Capture System and Dataset for 3D Reconstruction and Pose Tracking of Hand-Object InteractionCode1
High-Fidelity SLAM Using Gaussian Splatting with Rendering-Guided Densification and Regularized OptimizationCode1
VideoMAC: Video Masked Autoencoders Meet ConvNetsCode1
Towards Real-World Aerial Vision Guidance with Categorical 6D Pose TrackerCode1
APTv2: Benchmarking Animal Pose Estimation and Tracking with a Large-scale Dataset and BeyondCode1
PACE: A Large-Scale Dataset with Pose Annotations in Cluttered EnvironmentsCode1
Deep Event Visual OdometryCode1
Multimodal video and IMU kinematic dataset on daily life activities using affordable devices (VIDIMU)Code1
GarmentTracking: Category-Level Garment Pose TrackingCode1
3D-POP - An automated annotation approach to facilitate markerless 2D-3D tracking of freely moving birds with marker-based motion captureCode1
Highly Efficient 3D Human Pose Tracking from Events with Spiking Spatiotemporal TransformerCode1
3D Neural Embedding Likelihood: Probabilistic Inverse Graphics for Robust 6D Pose EstimationCode1
OpenApePose: a database of annotated ape photographs for pose estimationCode1
Enhancing Generalizable 6D Pose Tracking of an In-Hand Object with Tactile SensingCode1
PixTrack: Precise 6DoF Object Pose Tracking using NeRF Templates and Feature-metric AlignmentCode1
Semantic-Aware Fine-Grained CorrespondenceCode1
HOI4D: A 4D Egocentric Dataset for Category-Level Human-Object InteractionCode1
HMD-EgoPose: Head-Mounted Display-Based Egocentric Marker-Less Tool and Hand Pose Estimation for Augmented Surgical GuidanceCode1
PoseTrack21: A Dataset for Person Search, Multi-Object Tracking and Multi-Person Pose TrackingCode1
Sparse Steerable Convolutions: An Efficient Learning of SE(3)-Equivariant Features for Estimation and Tracking of Object Poses in 3D SpaceCode1
ROFT: Real-Time Optical Flow-Aided 6D Object Pose and Velocity TrackingCode1
SRT3D: A Sparse Region-Based 3D Object Tracking Approach for the Real WorldCode1
BundleTrack: 6D Pose Tracking for Novel Objects without Instance or Category-Level 3D ModelsCode1
Do Different Tracking Tasks Require Different Appearance Models?Code1
Data-driven 6D Pose Tracking by Calibrating Image Residuals in Synthetic DomainsCode1
Breaking Shortcut: Exploring Fully Convolutional Cycle-Consistency for Video Correspondence LearningCode1
CAPTRA: CAtegory-level Pose Tracking for Rigid and Articulated Objects from Point CloudsCode1
Iterative Greedy Matching for 3D Human Pose Tracking from Multiple ViewsCode1
Temporally Guided Articulated Hand Pose Tracking in Surgical VideosCode1
Deep Graph Pose: a semi-supervised deep graphical model for improved animal pose trackingCode1
BiHand: Recovering Hand Mesh with Multi-stage Bisected Hourglass NetworksCode1
se(3)-TrackNet: Data-driven 6D Pose Tracking by Calibrating Image Residuals in Synthetic DomainsCode1
Monocular Camera Localization in Prior LiDAR Maps with 2D-3D Line CorrespondencesCode1
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1DetTrackMOTA64.09Unverified
2KeyTrackMOTA61.15Unverified
3LightTrackMOTA58.01Unverified
4HRNet-W48 COCOMOTA57.93Unverified
5MSRA (FlowTrack)MOTA57.81Unverified
6TML++ (MIPAL)MOTA54.46Unverified
7STAFMOTA53.81Unverified
8ProTrackerMOTA51.82Unverified
9PoseFlowMOTA50.98Unverified
10PoseTrackMOTA48.37Unverified
#ModelMetricClaimedVerifiedStatus
1DetTrackMOTA64.3Unverified
2UniTrackMOTA63.5Unverified
34DHumans + ViTDetMOTA61.9Unverified
4MSRAMOTA61.37Unverified
5TML++ (MIPAL)MOTA54.86Unverified
#ModelMetricClaimedVerifiedStatus
1PoseTrackMOTA28.2Unverified