SOTAVerified

Keypoint Detection

Keypoint Detection is essential for analyzing and interpreting images in computer vision. It involves simultaneously detecting and localizing interesting points in an image. Keypoints, also known as interest points, are spatial locations or points in the image that define what is interesting or what stands out. They are invariant to image rotation, shrinkage, translation, distortion, etc. Keypoints examples are body joints, facial landmarks, or any other salient points in objects. Keypoints have uses in problems such as pose estimation, object detection and tracking, facial analysis, and augmented reality.

( Image credit: PifPaf: Composite Fields for Human Pose Estimation; "Learning to surf" by fotologic, license: CC-BY-2.0 )

Papers

Showing 150 of 339 papers

TitleStatusHype
Sapiens: Foundation for Human Vision ModelsCode9
Images Speak in Images: A Generalist Painter for In-Context Visual LearningCode4
ViTPose: Simple Vision Transformer Baselines for Human Pose EstimationCode3
ViTPose++: Vision Transformer for Generic Body Pose EstimationCode3
RMPE: Regional Multi-person Pose EstimationCode3
Realtime Multi-Person 2D Pose Estimation using Part Affinity FieldsCode3
DaD: Distilled Reinforcement Learning for Diverse Keypoint DetectionCode2
DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature MatchingCode2
Detector-Free Structure from MotionCode2
A Graph-Based Approach for Category-Agnostic Pose EstimationCode2
SNAKE: Shape-aware Neural 3D Keypoint FieldCode2
GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual LocalizationCode2
Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose EstimationCode2
OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal AssociationCode2
X-Pose: Detecting Any KeypointsCode2
Objects as PointsCode2
OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity FieldsCode2
SiLK -- Simple Learned KeypointsCode2
InstructDiffusion: A Generalist Modeling Interface for Vision TasksCode2
HoughNet: Integrating near and long-range evidence for visual detectionCode1
GSNet: Joint Vehicle Pose and Shape Reconstruction with Geometrical and Scene-aware SupervisionCode1
HPRNet: Hierarchical Point Regression for Whole-Body Human Pose EstimationCode1
GoodPoint: unsupervised learning of keypoint detection and descriptionCode1
Greedy Offset-Guided Keypoint Grouping for Human Pose EstimationCode1
3D3L: Deep Learned 3D Keypoint Detection and Description for LiDARsCode1
GPU optimization of the 3D Scale-invariant Feature Transform Algorithm and a Novel BRIEF-inspired 3D Fast DescriptorCode1
2.5D U-Net with Depth Reduction for 3D CryoET Object IdentificationCode1
CoFiNet: Reliable Coarse-to-fine Correspondences for Robust PointCloud RegistrationCode1
CoFiNet: Reliable Coarse-to-fine Correspondences for Robust Point Cloud RegistrationCode1
Center Direction Network for Grasping Point Localization on ClothsCode1
Enhancing Scene Coordinate Regression with Efficient Keypoint Detection and Sequential InformationCode1
Generative Partition Networks for Multi-Person Pose EstimationCode1
GRIT: General Robust Image Task BenchmarkCode1
Explicit Box Detection Unifies End-to-End Multi-Person Pose EstimationCode1
EvoPose2D: Pushing the Boundaries of 2D Human Pose Estimation using Accelerated Neuroevolution with Weight TransferCode1
Fast Fourier ConvolutionCode1
EEEA-Net: An Early Exit Evolutionary Neural Architecture SearchCode1
DexYCB: A Benchmark for Capturing Hand Grasping of ObjectsCode1
ALIKE: Accurate and Lightweight Keypoint Detection and Descriptor ExtractionCode1
Edge Weight Prediction For Category-Agnostic Pose EstimationCode1
Bottom-Up Human Pose Estimation by Ranking Heatmap-Guided Adaptive Keypoint EstimatesCode1
Bottom-Up Human Pose Estimation Via Disentangled Keypoint RegressionCode1
BPFNet: A Unified Framework for Bimodal Palmprint Alignment and FusionCode1
EP2P-Loc: End-to-End 3D Point to 2D Pixel Localization for Large-Scale Visual LocalizationCode1
CapeX: Category-Agnostic Pose Estimation from Textual Point ExplanationCode1
Cascaded Pyramid Network for Multi-Person Pose EstimationCode1
End-to-End Trainable Multi-Instance Pose Estimation with TransformersCode1
Centroid Distance Keypoint Detector for Colored Point CloudsCode1
A Novel Dataset for Keypoint Detection of quadruped Animals from ImagesCode1
Few-shot Keypoint Detection with Uncertainty Learning for Unseen SpeciesCode1
Show:102550
← PrevPage 1 of 7Next →

No leaderboard results yet.