SOTAVerified

Point Tracking

Point Tracking, often referred to as Tracking any Point (TAP) involves acquiring, focusing on, and continuously tracking specific target point/points across video frames. The system identifies the target point, maintains focus, and predicts its movement, enabling smooth tracking even if the target moves unpredictably, or through occlusions. TAP has wide applications like object tracking, surveillance, and autonomous navigation.

Papers

Showing 150 of 151 papers

TitleStatusHype
VGGT: Visual Geometry Grounded TransformerCode11
CoTracker3: Simpler and Better Point Tracking by Pseudo-Labelling Real VideosCode7
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image ManifoldCode7
TAPVid-3D: A Benchmark for Tracking Any Point in 3DCode5
BootsTAP: Bootstrapped Training for Tracking-Any-PointCode5
VGGSfM: Visual Geometry Grounded Deep Structure From MotionCode5
SpatialTrackerV2: 3D Point Tracking Made EasyCode4
FlowMap: High-Quality Camera Poses, Intrinsics, and Depth via Gradient DescentCode4
CoTracker: It is Better to Track TogetherCode4
TAPIP3D: Tracking Any Point in Persistent 3D GeometryCode3
Local All-Pair Correspondence for Point TrackingCode3
LEAP-VO: Long-term Effective Any Point Tracking for Visual OdometryCode3
Visual Geometry Grounded Deep Structure From MotionCode3
Segment Anything Meets Point TrackingCode3
TAPIR: Tracking Any Point with per-frame Initialization and temporal RefinementCode3
TAP-Vid: A Benchmark for Tracking Any Point in a VideoCode3
CharaConsist: Fine-Grained Consistent Character GenerationCode2
Seurat: From Moving Points to DepthCode2
POMATO: Marrying Pointmap Matching with Temporal Motion for Dynamic 3D ReconstructionCode2
Track-On: Transformer-based Online Point Tracking with MemoryCode2
Exploring Temporally-Aware Features for Point TrackingCode2
MATCHA: Towards Matching AnythingCode2
Self-Supervised Any-Point Tracking by Contrastive Random WalksCode2
Fréchet Video Motion Distance: A Metric for Evaluating Motion Consistency in VideosCode2
Decomposition Betters Tracking Everything EverywhereCode2
Dynamic Gaussian Marbles for Novel View Synthesis of Casual Monocular VideosCode2
EchoTracker: Advancing Myocardial Point Tracking in EchocardiographyCode2
Dense Optical Tracking: Connecting the DotsCode2
PointOdyssey: A Large-Scale Synthetic Dataset for Long-Term Point TrackingCode2
FreeDrag: Feature Dragging for Reliable Point-based Image EditingCode2
Perception Test: A Diagnostic Benchmark for Multimodal Video ModelsCode2
DragLoRA: Online Optimization of LoRA Adapters for Drag-based Image Editing in Diffusion ModelCode1
Low Complexity Point Tracking of the Myocardium in 2D EchocardiographyCode1
Online Dense Point Tracking with Streaming MemoryCode1
Motion-prior Contrast Maximization for Dense Continuous-Time Motion EstimationCode1
Augmenting Efficient Real-time Surgical Instrument Segmentation in Video with Point Tracking and Segment AnythingCode1
MFT: Long-Term Tracking of Every PixelCode1
Exploring Point-BEV Fusion for 3D Point Cloud Object Tracking with TransformerCode1
PTTR: Relational 3D Point Cloud Object Tracking with TransformerCode1
Point 4D Transformer Networks for Spatio-Temporal Modeling in Point Cloud VideosCode1
Deep Learning based Virtual Point Tracking for Real-Time Target-less Dynamic Displacement Measurement in Railway ApplicationsCode1
SD-DefSLAM: Semi-Direct Monocular SLAM for Deformable and Intracorporeal ScenesCode1
Integrated Switched Capacitor Array and Synchronous Charge Extraction with Adaptive Hybrid MPPT for Piezoelectric Harvesters0
MoVieS: Motion-Aware 4D Dynamic View Synthesis in One Second0
Learning to Track Any Points from Human Motion0
Robotic Manipulation by Imitating Generated Videos Without Physical Demonstrations0
You Are Your Best Teacher: Semi-Supervised Surgical Point Tracking with Cycle-Consistent Self-Distillation0
TimeTracker: Event-based Continuous Point Tracking for Video Frame Interpolation with Non-linear Motion0
St4RTrack: Simultaneous 4D Reconstruction and Tracking in the World0
TAPNext: Tracking Any Point (TAP) as Next Token PredictionCode0
Show:102550
← PrevPage 1 of 4Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1LocoTrack-BAverage Jaccard69.4Unverified
2BootsTAPIRAverage Jaccard66.2Unverified
3CoTrackerAverage Jaccard65.9Unverified
#ModelMetricClaimedVerifiedStatus
1PIPs++Survival50.47Unverified
2PIPs+Survival49.88Unverified
#ModelMetricClaimedVerifiedStatus
1LocoTrack-BAverage Jaccard64.8Unverified
2CoTrackerAverage Jaccard62.2Unverified
#ModelMetricClaimedVerifiedStatus
1BootsTAPIRAverage Jaccard61.4Unverified
2LocoTrack-BAverage Jaccard59.1Unverified
#ModelMetricClaimedVerifiedStatus
1LocoTrack-BAverage Jaccard52.3Unverified
2CoTrackerAverage Jaccard48.8Unverified
#ModelMetricClaimedVerifiedStatus
1BootsTAPIRAverage Jaccard72.4Unverified
2LocoTrack-BAverage Jaccard70.8Unverified
#ModelMetricClaimedVerifiedStatus
1Static BaselineAverage Jaccard0.36Unverified
#ModelMetricClaimedVerifiedStatus
1PIPs++MTE4.6Unverified