SOTAVerified

Visual Tracking

Visual Tracking is an essential and actively researched problem in the field of computer vision with various real-world applications such as robotic services, smart surveillance systems, autonomous driving, and human-computer interaction. It refers to the automatic estimation of the trajectory of an arbitrary target object, usually specified by a bounding box in the first frame, as it moves around in subsequent video frames.

Source: Learning Reinforced Attentional Representation for End-to-End Visual Tracking

Papers

Showing 125 of 525 papers

TitleStatusHype
SAMURAI: Adapting Segment Anything Model for Zero-Shot Visual Tracking with Motion-Aware MemoryCode9
Local All-Pair Correspondence for Point TrackingCode3
TAPIR: Tracking Any Point with per-frame Initialization and temporal RefinementCode3
A Distractor-Aware Memory for Visual Object Tracking with SAM2Code3
Universal Instance Perception as Object Discovery and RetrievalCode3
You Only Demonstrate Once: Category-Level Manipulation from Single Visual DemonstrationCode2
XTrack: Multimodal Training Boosts RGB-X Video Object TrackersCode2
Tracking Meets LoRA: Faster Training, Larger Model, Stronger PerformanceCode2
R1-Track: Direct Application of MLLMs to Visual Object Tracking via Reinforcement LearningCode2
Learning an Adaptive and View-Invariant Vision Transformer for Real-Time UAV TrackingCode2
Long-term Frame-Event Visual Tracking: Benchmark Dataset and BaselineCode2
Towards Real-World Visual Tracking with Temporal ContextsCode2
Fast Online Object Tracking and Segmentation: A Unifying ApproachCode2
VastTrack: Vast Category Visual Object TrackingCode2
Autoregressive Visual TrackingCode2
Visible-Thermal Multiple Object Tracking: Large-scale Video Dataset and Progressive Fusion ApproachCode2
Event Stream-based Visual Object Tracking: A High-Resolution Benchmark Dataset and A Novel BaselineCode2
Exploring Enhanced Contextual Information for Video-Level Object TrackingCode2
Cross-Modal Object Tracking via Modality-Aware Fusion Network and A Large-Scale DatasetCode2
Joint Feature Learning and Relation Modeling for Tracking: A One-Stream FrameworkCode2
An Experimental Study on Exploring Strong Lightweight Vision Transformers via Masked Image Modeling Pre-TrainingCode2
ODTrack: Online Dense Temporal Token Learning for Visual TrackingCode2
Similarity-Guided Layer-Adaptive Vision Transformer for UAV TrackingCode2
Ad2Attack: Adaptive Adversarial Attack on Real-Time UAV TrackingCode1
Deeper and Wider Siamese Networks for Real-Time Visual TrackingCode1
Show:102550
← PrevPage 1 of 21Next →

Benchmark Results

#ModelMetricClaimedVerifiedStatus
1ARTrack-LAUC60.3Unverified
2UNINEXT-HAUC59.3Unverified
3JointNLTAUC56.9Unverified
4OSTrackAUC55.9Unverified
5TransTAUC50.7Unverified
6AdaSwitcherAUC42Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (Panning MOVi-E)Average Jaccard61.3Unverified
2TAPIR (MOVi-E)Average Jaccard59.8Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (Panning MOVi-E)Average Jaccard57.2Unverified
2TAPIR (MOVi-E)Average Jaccard57.1Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (Panning MOVi-E)Average Jaccard84.7Unverified
2TAPIR (MOVi-E)Average Jaccard84.3Unverified
#ModelMetricClaimedVerifiedStatus
1TAPIR (MOVi-E)Average Jaccard66.2Unverified
2TAPIR (Panning MOVi-E)Average Jaccard62.7Unverified
#ModelMetricClaimedVerifiedStatus
1TATrack-LAUC71.1Unverified
#ModelMetricClaimedVerifiedStatus
1SiamFC-lu (Ours)AUC0.32Unverified
#ModelMetricClaimedVerifiedStatus
1SiamFC-lu (Ours)AUC0.66Unverified
#ModelMetricClaimedVerifiedStatus
1MDNetScore0.64Unverified
#ModelMetricClaimedVerifiedStatus
1TATrack-LACCURACY0.85Unverified