SOTAVerified

Keypoint Detection

Keypoint Detection is essential for analyzing and interpreting images in computer vision. It involves simultaneously detecting and localizing interesting points in an image. Keypoints, also known as interest points, are spatial locations or points in the image that define what is interesting or what stands out. They are invariant to image rotation, shrinkage, translation, distortion, etc. Keypoints examples are body joints, facial landmarks, or any other salient points in objects. Keypoints have uses in problems such as pose estimation, object detection and tracking, facial analysis, and augmented reality.

( Image credit: PifPaf: Composite Fields for Human Pose Estimation; "Learning to surf" by fotologic, license: CC-BY-2.0 )

Papers

Showing 125 of 339 papers

TitleStatusHype
Sapiens: Foundation for Human Vision ModelsCode9
Images Speak in Images: A Generalist Painter for In-Context Visual LearningCode4
ViTPose++: Vision Transformer for Generic Body Pose EstimationCode3
ViTPose: Simple Vision Transformer Baselines for Human Pose EstimationCode3
RMPE: Regional Multi-person Pose EstimationCode3
Realtime Multi-Person 2D Pose Estimation using Part Affinity FieldsCode3
DaD: Distilled Reinforcement Learning for Diverse Keypoint DetectionCode2
GSplatLoc: Grounding Keypoint Descriptors into 3D Gaussian Splatting for Improved Visual LocalizationCode2
Instance-Adaptive and Geometric-Aware Keypoint Learning for Category-Level 6D Object Pose EstimationCode2
A Graph-Based Approach for Category-Agnostic Pose EstimationCode2
X-Pose: Detecting Any KeypointsCode2
InstructDiffusion: A Generalist Modeling Interface for Vision TasksCode2
DeDoDe: Detect, Don't Describe -- Describe, Don't Detect for Local Feature MatchingCode2
Detector-Free Structure from MotionCode2
SiLK -- Simple Learned KeypointsCode2
SNAKE: Shape-aware Neural 3D Keypoint FieldCode2
OpenPifPaf: Composite Fields for Semantic Keypoint Detection and Spatio-Temporal AssociationCode2
Objects as PointsCode2
OpenPose: Realtime Multi-Person 2D Pose Estimation using Part Affinity FieldsCode2
TimePoint: Accelerated Time Series Alignment via Self-Supervised Keypoint and Descriptor LearningCode1
Why Not Replace? Sustaining Long-Term Visual Localization via Handcrafted-Learned Feature Collaboration on CPUCode1
REF-VLM: Triplet-Based Referring Paradigm for Unified Visual DecodingCode1
2.5D U-Net with Depth Reduction for 3D CryoET Object IdentificationCode1
Enhancing Scene Coordinate Regression with Efficient Keypoint Detection and Sequential InformationCode1
Measure Anything: Real-time, Multi-stage Vision-based Dimensional Measurement using Segment AnythingCode1
Show:102550
← PrevPage 1 of 14Next →

No leaderboard results yet.